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Preface 


This  textbook  is  based  on  my  lectures  to  students  at  the  Mos- 
cow Regional  Pedagogical  Institute  (1945  to  1948)  and  Moscow 
University  from  1945  on.  In  writing  this  book  we  set  ourselves 
the  difficult  task  of  treating  in  a  single  volume  the  fundamentals 
of  atomic  theory,  that  is,  Schrodinger's  nonrelativistic  theory, 
Dirac's  relativistic  theory,  the  theory  of  multielectron  atonis, 
and  the  basic  applications  of  quantum  mechanics  to  solid  state 
physics.  Our  aim  was  to  combine  the  exposition  of  general  the- 
oretical principles  with  examples  of  the  application  of  quantum 
mechanics  to  specific  problems  connected  with  atomic  structure. 
To  avoid  overloading  this  book,  we  have  abridged  the  treatment  of 
certain  specialized  topics,  but  in  such  cases  we  have  endeavored 
to  supply  references  to  standard  works  on  the  subject. 

In  most  textbooks  the  solution  of  specific  problems  with  the 
help  of  Schrodinger's  equation  is  handled  in  fairly  elegant  form. 
The  basic  mathematical  tools  required  for  this  purpose  are  a 
knowledge  of  second-order  differential  equations  and  various  spe- 
cial functions  (including  the  Hermite,  Legendre  and  Laguerre 
polynomials).  However,  applications  of  Dirac's  theory  to  specific 
problems  (such  as  the  hydrogen  atom)  are  on  the  whole  handled 
less  satisfactorily.  In  some  cases  the  calculations  are  so  long 
and  cumbersome  that  it  is  difficult  to  perceive  the  physical  mean- 
ing of  the  solutions.  In  others  there  is  no  actual  derivation  of  the 
results  or  only  a  rough  proof  is  given.  In  an  attempt  to  avoid 
these  pitfalls,  we  have  used  an  approximate  form  of  Dirac's  equa- 
tion for  our  treatment  of  the  hydrogen  atom  (Chapter  19).  This 
approximation  still  enables  us  to  obtain  the  formula  for  the  fine 
structure  of  the  energy  levels  and  the  selection  rules  (Chapter  18 
and  20).  Our  analysis  of  the  Lamb  shift  due  to  the  electron- 
positron  vacuum  is  also  somewhat  simplified  (Chapter  22). 

Several  good  problem  books  in  quantum  mechanics  are  avail- 
able, and  therefore  we  shall  consider  only  a  few  problems  chosen 
with  the  aim  of  elucidating  and  supplementing  the  general  discus- 
sion. 

The  first  part  of  this  book  was  written  jointly  by  me  and  Yu.  M. 
Loskutov,  and  the  second  part  jointly  by  me  and  I.  M.  Ternov. 
Great  assistance  was  rendered  by  M.  M.  Kolesnikova  in  condens- 
ing notes  based  on  my  lectures  on  quantum  mechanics  and  in 
preparing  the  manuscript  for  the  press.  Chapter  25  was  carefully 
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read  by  N.  N.  Kolesnikov,  who  made  a  number  of  valuable  com- 
ments. I  would  like  to  mention  the  great  pains  taken  by  S.  I.  Larin 
in  editing  the  whole  manuscript. 


A.  A.  Sokolov 


Introduction 


Quantum  mechanics  dates  only  from  the  1920's.  This  important 
branch  of  theoretical  physics  deals  with  the  fundamental  problem 
of  the  behavior  of  micropar tides  (for  instance,  the  behavior  of 
electrons  in  an  atom).  As  a  theo-.y,  quantum  mechanics  represents 
an  extension  of  classical  mechanics,  electrodynamics  (including 
the  theory  of  the  electron  and  the  theory  of  relativity) ,  the  kinetic 
theory  of  matter,  and  other  branches  of  theoretical  physics. 

Historically,  the  development  of  every  branch  of  theoretical 
physics  involves  two  main  stages.  First  comes  the  accumulation  of 
experimental  facts,  the  discovery  of  semiempirical  laws,  and  the 
development  of  preliminary  hypotheses  and  theories.  This  is  fol- 
lowed by  the  discovery  of  general  laws,  which  provide  a  basis  for 
interpreting  a  large  number  of  phenomena.  For  example,  the  first 
or  pre-Newtonian  stage  of  mechanics  consisted  of  the  discovery  of 
a  number  of  seemingly  unrelated  laws:  the  law  of  inertia,  the  law 
of  free  fall  under  the  action  of  a  gravitational  field  and  Kepler's 
laws  of  planetary  motion.  Most  of  these  laws  were  discovered  only 
after  years  of  painstaking  work  by  many  scientists.  Thus,  many 
astronomical  observations  preceded  the  discovery  of  Kepler's 
laws.  We  may  recall  the  great  efforts  of  Copernicus,  Bruno,  Gal- 
ileo, and  others  to  establish  that  the  Sun  is  the  center  of  our 
planetary  system  and  that  the  Earth  is  only  a  planet  like  Mars, 
Venus,  or  Jupiter.  It  was  only  after  working  for  fifteen  years  on 
Tycho  Brahe's  extremely  valuable  observational  data  that  Kepler 
found  the  semiempirical  laws  describing  planetary  motion.  After 
these  preliminary,  seemingly  independent  laws  had  been  estab- 
lished, Newton  was  able  to  show  that  they  all  rested  on  the  same 
theoretical  foundation.  Newton's  three  laws  of  motion  and  the  law 
of  universal  gravitation  opened  a  new  stage  in  the  development  of 
theoretical  mechanics.  One  of  the  great  triumphs  of  Newtonian 
mechanics  was  Leverrier's  prediction  of  the  existence  of  a  new 
planet,  Neptune,  from  perturbations  in  the  motion  of  Jupiter. 

In  a  similar  fashion,  Maxwell's  formulation  of  the  laws  of  elec- 
trodynamics was  preceded  by  the  discovery  of  empirical  laws  de- 
scribing various  electric  and  magnetic  phenomena.  Coulomb's 
law  of  interaction  between  electric  charges  and  magnetic  poles1 
and  the  Biot-Savart  law  of  interaction  between  an  electric  current 
and  a  magnetic  pole  were  found  by  analogy  with  Newton's  law  of 


1As  magnetic  monopoles  do  not  exist  in  nature,  Coulomb's  law  in  magnetostatics  is 
verified  by  means  of  magnetic  dipoles. 
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gravitation.  All  of  these  phenomena  were  explained  on  the  basis  of 
the  principle  of  "action  at  a  distance,"  according  to  which  one 
charge  acts  directly  on  another  through  the  intervening  space. 

After  Newton,  and  independently  of  investigations  of  electric 
and  magnetic  phenomena,  considerable  attention  was  devoted  to 
optics.  At  a  relatively  early  stage,  it  was  established  that  light 
consists  of  transverse  waves  propagating  with  a  finite  velocity 
of  c^3-10l°  cm/ sec.  The  nature  of  these  waves,  however,  re- 
mained unknown. 

All  of  these  preliminary  studies  belonged  to  the  first  stage  of 
development  of  electrodynamics:  they  prepared  the  ground  for 
Maxwell's  theory,  which  had  approximately  the  same  unifying  role 
in  electrodynamics  as  Newton's  laws  in  mechanics.  Maxwell's 
equations  provided  a  powerful  tool  for  the  investigation  of  electric, 
magnetic  and  optical  phenomena.  Maxwell's  theory  predicted  the 
existence  of  electromagnetic  fields,  which  carry  the  interaction 
continuously  from  point  to  point,  and  of  electromagnetic  waves, 
which  were  later  discovered  by  Hertz.  The  theory  of  propagation 
of  electromagnetic  waves  underlies  all  of  modern  radio  engineering. 

Another  important  result  of  Maxwell's  theory  was  aproof  of  the 
wave  nature  of  light. 

The  view  that  matter  and  electricity  have  an  atomic  structure 
was  of  considerable  importance  in  connection  with  the  appearance 
of  quantum  mechanics.  This  view  had  very  ancient  roots,  but 
remained  without  scientific  foundation  until  the  discovery  of  the 
fundamental  law  of  chemistry— the  law  of  exact  proportions.  The 
kinetic  theory  of  matter  and,  in  particular,  the  kinetic  theory  of 
gases—based  on  the  classical  Maxwell- Boltzmann  statistics— were 
important  steps  in  the  development  of  atomic  theory.  It  is  worth 
noting  that  the  classical  Maxwell- Boltzmann  statistics,  which  rests 
on  probability  theory,  cannot  be  completely  explained  in  terms  of 
Newtonian  mechanics  and  contains  certain  features  that  are  char- 
acteristic only  of  large  collections  of  particles  (for  example,  the 
irreversibility  of  certain  processes).  Statistical  methods  made  it 
possible  to  explain  a  number  of  macroscopic  properties  of  matter, 
such  as  temperature  and  specific  heat;  this  provided  indirect 
evidence  of  the  atomic  structure  of  matter. 

One  of  the  decisive  proofs  of  the  atomic  theory  of  matter  was 
the  discovery  of  fluctuations,  that  is,  statistical  fluctuations  in  the 
behavior  of  individual  molecules.  Brownian  motion  was  particularly 
important  in  this  connection,  as  it  provided  the  evidence  of  molec- 
ular movement  in  a  liquid.  Even  more  suggestive  proofs  of  the 
atomic  structure  of  matter  were  provided  by  Laue's  observation  of 
the  diffraction  of  x-rays  in  crystals  and  Aston's  mass-spectro- 
graphic  measurements  of  the  atomic  weights  of  individual  isotopes 
of  various  elements. 

From  an  analysis  of  Faraday's  laws  of  electrolysis,  Helmholtz 
showed  that  there  must  be  a  fundamental  quantity  of  electricity, 
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equal  to  4.8- 10  10  esu,  such  that  any  charge,  positive  or  negative, 
is  an  integral  multiple  of  this  charge.  Studies  of  anode  rays  indi- 
cated that  positive  charges  always  appear  as  ions;  that  is,  a  positive 
charge  is  always  associated  with  the  basic  mass  of  an  atom.  The 
lightest  positive  ion  is  that  of  a  hydrogen  atom.  It  is  known  as  a 
proton  and  it's  mass  is  nearly  the  same  as  the  mass  of  a  neutral 
hydrogen  atom.  The  carrier  of  a  negative  charge  can  take  the  form 
of  a  negative  ion  or  of  a  much  lighter  particle  known  as  an  electron. 
From  measurements  of  the  deflection  of  cathode  rays  (a  beam  of 
electrons)  in  electric  and  magnetic  fields,  it  was  found  that  the 
mass  of  the  electron  was  about  1/1836  of  the  mass  of  the  proton.2 

These  discoveries  led  to  Lorentz's  electron  theory,  which 
represents  an  interesting  synthesis  of  Maxwell's  electrodynamics 
for  a  vacuum  and  the  atomic  view  that  matter  consists  of  positive 
and  negative  charges.  In  Lorentz's  theory  the  magnetic  permea- 
bility, dielectric  constant,  and  conductivity  of  a  medium  were 
obtained  by  averaging  Maxwell's  equations  for  a  vacuum  over 
charges  and  currents  of  particles  of  the  medium.  A  conductor  was 
treated  as  a  medium  filled  with  free  electrons  or,  in  other  words, 
an  "electron  gas."  It  followed  from  Lorentz's  theory  that  the 
dielectric  constant  depends  on  the  frequency  of  electromagnetic 
waves,  whereas  in  Maxwell's  theory  it  had  been  assumed  that  this 
quantity  is  a  constant.  Lorentz's  theory  provided  an  explanation  of 
the  dispersion  of  light.  The  appearance  of  this  theory  was  accom- 
panied by  the  extension  of  electrodynamics  to  frames  of  reference 
traveling  with  constant  relative  velocities;  this  culminated  in  the 
special  theory  of  relativity.  It  is  well  known  that  all  laws  of  motion— 
whether  they  be  Newton' slaws  or  Maxwell's  equations  for  the  motion 
of  an  electromagnetic  field— must  be  associated  with  a  frame  of 
reference.  Newton  believed  that  his  laws  were  related  to  an  abso- 
lute frame  of  reference.  Even  in  his  writings,  however,  this  notion 
remained  purely  metaphysical,  and  Newton  himself  discovered  the 
principle  of  relativity  in  mechanics  according  to  which  it  is  im- 
possible to  detect  a  uniform  rectilinear  motion  of  a  body  (or  a 
frame  of  reference)  relative  to  this  absolute  system,  because  all 
frames  of  reference  moving  linearly  with  constant  relative  velocities 
with  respect  to  each  other  are  completely  equivalent. 

Consider  the  Galilean- Newtonian  transformation  from  one  in- 
ertial  system  to  another,  moving  along  the  x  axis  with  relative 
velocity  v 


2 
In  1932,  a  particle  with  a  positive  charge  and  mass  equal  to  that  of  an  electron,  known 

as  the  positron,  was  discovered  Positrons  are  formed  in  small  quantities  when  cosmic 
rays  pass  through  matter.  Under  ordinary  conditions  a  positron  cannot  exist  for  any  sig- 
nificant length  of  time  because  it  combines  with  an  electron  and  the  two  particles  are 
converted  into  gamma-ray  photons  (see  Chapters  3  and  22). 
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where  the  primed  coordinates  refer  to  the  moving  system,  and  the 
unprimed  coordinates  to  the  stationary  system.  We  find  that 
accelerations  and  forces  are  identical  in  the  two  frames  of  refer- 
ence, and  therefore  the  equations  of  mechanics  (in  which  the 
velocity  does  not  appear)  are  invariant  under  this  transformation. 
If  the  Galilean- Newtonian  transformation  is  applied  to  the  Maxwell- 
Lorentz  equations,  they  assume  different  forms  in  different  inertial 
systems,  because  the  equations  contain  the  velocity  of  propagation 
of  electromagnetic  waves  which,  added  vector ially,  has  different 
values  in  different  inertial  systems.  The  original  Michelson-Morley 
and  other  numerous  experiments  showed,  however,  that  the  speed  of 
light  is  the  same  in  any  direction  in  all  inertial  coordinate  systems. 
As  a  result,  Einstein  generalized  the  Newtonian  principle  of  rela- 
tivity in  a  way  that  led  directly  to  the  so-called  Lorentz  trans- 
formations 


*=  *=!! 


<— B£. 


P=4. 


The  classical  laws  of  electrodynamics  are  invariant  under  this 
transformation.  Since  the  equations  of  Newtonian  mechanics, 
however,  are  not  invariant  under  the  Lorentz  transformations, 
they  had  to  be  replaced  by  relativistic  equations  in  which  the 
mass  m  of  a  moving  particle  was  related  to  its  velocity  v  and 
its  rest  mass  m0by  the  relationship 


At  low  velocities,  where  P2^0,  the  relativistic  equations  reduce 
to  the  Newtonian  formulations. 

The  Maxwell- Lorentz  equations  for  an  electromagnetic  field 
and  the  relativistic  equations  of  motion  of  an  electron  constituted 
the  culminating  stage  of  the  classical  electron  theory.  According 
to  this  theory,  light  consists  of  electromagnetic  waves  and  an 
electron  is  a  particle  whose  motion  is  described  by  relativistic 
mechanics.  The  success  of  the  Maxwell- Lorentz  theory  in  account- 
ing for  certain  microscopic  phenomena  (the  propagation  and  dis- 
persion of  light,  the  motion  of  an  electron  in  electric  and  magnetic 
fields,  and  so  forth)  was  accompanied  by  the  discovery  of  exper- 
imental facts  that  could  not  be  explained  with  classical  concepts. 
These  experiments  will  be  described  in  Chapters  1—3,  and  there- 
fore we  shall  mention  them  here  only  very  briefly. 

In  the  first  place,  it  was  found  that  black-body  radiation,  the 
photoelectric  effect,  and  the  Compton  effect  could  be  explained  only 
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on  the  basis  of  corpuscular  properties  of  light.  This  was  the  im- 
plicit assumption  of  the  Planck- Einstein  photon  theory,  in  which 
the  discrete  structure  of  light  was  described  in  terms  of  Planck's 
constant  h  =  6.62*  10""27  erg- sec.  The  photon  theory  was  also  suc- 
cessfully used  by  Bohr  in  constructing  the  first  quantum  theory  of 
the  atom,  based  on  the  planetary  model  suggested  by  Rutherford. 
In  the  second  place,  a  number  of  experimental  facts,  including  the 
electron  diffraction,  indicated  that  in  addition  to  its  corpuscular 
properties,  an  electron  has  wave  properties.  De  Broglie's  def- 
inition of  the  wavelength  of  an  electron  also  included  Planck's  con- 
stant h.  This  led  eventually  to  the  development  of  a  new  science- 
electron  optics— which  provides  a  theoretical  basis  for  electron 
microscopy. 

The  SchrBdinger  wave  equation  (1926)  was  the  first  general 
theoretical  treatment  that  explained  both  of  these  classes  of  phe- 
nomena and  unified  the  preliminary  theories  of  Planck,  Einstein, 
Bohr,  and  de  Broglie.  This  equation  made  it  possible  to  discover 
the  laws  of  behavior  of  electrons  and  other  elementary  particles 
and  to  construct  a  relatively  systematic  theory  of  radiation  that 
took  into  account  the  quantum  nature  of  light.  For  atomic  physi- 
cists, the  Schr'ddinger  equation  was  one  of  the  most  powerful  tools. 
Many  phenomena  associated  with  the  behavior  of  an  electron  in  an 
atom  and  with  the  absorption  and  emission  of  light  by  an  atom  were 
provided  with  a  theoretical  explanation  (see  Chapters  4-14).  The 
later  development  of  quantum  theory  showed  that  the  Schrbdinger 
equation  did  not  describe  all  the  properties  of  atoms.  In  particu- 
lar, it  could  not  explain  correctly  the  interaction  of  an  atom  with  a 
magnetic  field  (for  instance,  the  anomalous  Zeeman  effect)  and  it 
could  not  be  used  to  construct  a  theory  of  multielectron  atoms. 
One  of  the  main  reasons  for  this  was  that  the  Schrbdinger  theory 
did  not  take  into  account  the  electron  spin. 

Dirac's  relativistic  theory  (see  Chapters  15-17)  was  an  exten- 
sion of  the  Schrbdinger  theory  that  considered  relativistic  and  spin 
effects  of  moving  electrons  (see  Chapters  18-20).  It  turned  out  that 
the  quantitative  corrections  due  to  relativistic  effects  were  rela- 
tively small,  but  that  spin  effects  were  of  fundamental  importance 
in  connection  with  the  fine  structure  of  multielectron  atoms  (see 
Chapters  23  and  24).  These  effects  explained  the  filling  of  electron 
shells  in  an  atom  and  gave  a  theoretical  basis  to  Mendeleyev's 
periodic  table  of  elements  (see  Chapters  25  and  26). 

Although  the  fundamental  problems  related  to  the  structure  of 
the  atom  were  basically  solved  with  the  appearance  of  Dirac's 
equation,  we  are  constantly  adding  further  details  to  our  knowl- 
edge. At  present  a  great  deal  of  attention  is  being  devoted  to  the 
influer^e  of  the  electron- positron  vacuum  and  magnetic  moments 
on  the  energy  levels  of  atoms  (see  Chapters  21  and  22).  Quantum 
mechanics  has  also  been  applied  to  simple  molecules  (Chapter  27), 
solid  state  physics  (Chapter  6),  and  the  atomic  nucleus. 


Part  I 


Nonrelativistic  Quantum  Mechanics 


Chapter  1 


The  Quantum  Theory  of  Light 


Before  the  supremacy  of  classical  physics  was  challenged  by 
the  advent  of  quantum  mechanics  in  the  beginning  of  this  century, 
particle  motion  was  sharply  distinguished  from  wave  motion. 
According  to  the  classical  picture,  the  world  consisted  of  particles 
(for  example,  electrons  and  ions)  and  fields  (for  example,  light). 
This  picture  was  completed  by  Maxwell's  theory  (1873),  which 
appeared  to  have  definitely  established  that  fields  had  wave-like 
properties. 

Towards  the  end  of  the  nineteenth  century  and  in  the  first  years 
of  this  century,  this  state  of  affairs  was  disturbed  by  the  discovery 
of  experimental  facts  that  did  not  fit  into  the  classical  conceptual 
framework.  On  the  one  hand,  there  were  certain  phenomena,  such 
as  the  radiation  spectrum  of  an  ideal  black  body,  the  photoelectric 
effect,  and  the  Compton  effect,  which  could  be  understood  only  in 
terms  of  particle-like  properties  of  light.  On  the  other  hand,  elec- 
trons were  observed  to  have  wave-like  properties,  such  as  diffrac- 
tion, which  later  served  as  a  basis  for  the  development  of  electron 
optics. 


A.  PRINCIPLES  OF  THE  ELECTRON  THEORY 

The  behavior  of  the  electromagnetic  field  produced  by  a  given 
distribution  of  charge  and  current  is  described  by  the  well-known 
Maxwell-Lorentz  equations 


c    dt 


V  -£  =  4wp, 


where  E  and  //are  the  electric  and  magnetic  field  intensities,  re- 
spectively, p  is  the  charge  density  (for  example,  of  the  electron), 
and  v  is  its  velocity. 
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To  start  with,  an  electromagnetic  field  transmits  interactions 
between  the  charges.  The  interaction  between  stationary  charges 
e  is  transmitted  by  an  electrostatic  field  which  satisfies  Coulomb's 

law    (v  =?*—-],    whereas   the  interaction  between  moving  charges 


is  transmitted  by  a  system  of  electric  and  magnetic  fields,  since 
a  moving  charge  can  be  regarded  as  an  electric  current  and  it  is 
well  known  that  a  current  interacts  with  a  magnetic  field.  Elec- 
tromagnetic fields  are  always  associated  with  sources  of  the 
appropriate  type  (for  example,  charges). 

Secondly,  electromagnetic  fields  may  be  regarded  as  electro- 
magnetic waves,  which  propagate  with  the  velocity  of  light  c  (radio 
or  light  waves).  As  a  particular  example  of  a  source  of  light  waves, 
we  can  take  an  accelerated  charge.  The  latter  emits  radiant  energy; 
per  unit  time,  this  energy  is 

W—  2  e*w*  (I  21 

w  —  "3~~7~»  (•*••*/ 

where  e  is  the  charge,  and  w  its  acceleration.  Once  electromagnetic 
waves  have  been  produced,  they  can  exist  independently  of  their 
sources. 

The  equation  describing  the  propagation  of  a  light  wave  is 
obtained  from  Eqs.  (1.1)  by  setting  the  charge  density  p  equal  to 
zero.  We  can  then  eliminate  the  vector  H  from  the  second  Maxwell 
equation  by  taking  the  curl  of  this  equation  and  substituting  into  it 
the  first  equation.  Since  V  •£--=(),  we  can  use  the  vector  relation 


VxVx£=:   V(V-£)—  V2£=— 

to  obtain  the  following  wave  equation,  which  holds  for  the  compo- 
nents of  both  vectors  E  and  //: 


where  /  is  any  component  of  the  vectors  £  or//. 

A  more  detailed  analysis  of  the  Maxwell-  Lorentz  equations 
shows  that  electromagnetic  waves  are  transverse.  This  means  that 
the  electric  field  intensity  (E)  and  magnetic  field  intensity  (//)  are 
mutually  orthogonal,  and  also  orthogonal  to  the  wave  vector  k, 
which  points  in  the  direction  of  propagation  of  the  electromagnetic 
wave.  The  vectors  form  a  triad  such  that  when  a  right-handed 
screw  is  turned  from  E  to  //,  it  moves  along  the  direction  of  k  : 

//=*°x£,  (1.4) 

where  W  —  k/k  is  a  unit  vector. 
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A  charge  (say,  an  electron)  moving  in  externally  applied  electric 
and  magnetic  fields  experiences  a  force 

l*x#),  (1.5) 

which  is  called  the  Lorentz  force. 

Taking   into   account  the   relativistic   variation  of  mass,   the 
equation  of  motion  of  an  electron  in  an  external  field  has  the  form 


where 


We  can  select  a  Lagrangian  function  X  in  such  a  manner  that 
the  variational  principle 


0,  (1.7) 

or,  in  more  explicit  form, 

Jffsf-CjH  ('='.*>  3),  d.8) 

will  yield  the  equation  of  motion  of  an  electron  (1.6).  Here  xl=x9 
x*s=y*  x*  =  z  are  the  spatial  coordinates,  and  the  jf£  denote  the 
corresponding  velocities.  To  obtain  (1.6),  we  must  set 


*->t,  (1.9) 

where  4  and  o  are  the  vector  and  scalar  potentials  of  the  electro- 
magnetic field.  These  potentials  are  related  by  the  Lorentz  condi- 
tion 

v.^+j|5=o.  (i.io) 

The  electric  and  magnetic  field  intensities  can  be  expressed  in 
terms  of  A  and  O  by  means  of  the  relations 


, 
0  dt  (1.11) 

//=vxA 

We  find  the  following  expression  for  the  electron  momentum: 
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Similarly,   for  the  generalized  force  acting  on  the  electron,  we 
obtain 


a--  -£+*(• 


Substituting  (1.12)  and  (1.13)  into  (1.8)  and  taking  into  account  (1.10) 
and  (1.11),  we  obtain  Eq.  (1.6)  for  the  motion  of  an  electron.  Thus, 
our  choice  of  the  Lagrangian  is  justified. 

Since  we  know  X,  we  may  also  determine  the  Hamiltonian  //: 


=    PA-* 


It  is  well  known  that  the  Hamiltonian  should  not  be  expressed  in 
terms  of  the  velocity  c$t  but  in  terms  of  the  generalized  momentum 

p=p  —  —  A,  which,   according   to    (1.12),   is   related  to  cp    by  the 

c 

equation 


Therefore,  the  relativistic  form  of  the  Hamiltonian  is 


//==  /c9/>a  +  mfc4  +  *$.  (1.14) 


We  note  that  if  the  potentials  are  time- independent ,  the  Hamiltonian 
is  equal  to  the  total  energy  ( E  =  H ). 

In  the  nonrelativistic  approximation  (P<m0c),  Eq.  (1.14)  can  be 
written  in  the  form 


where  H'  is  approximately  equal  to  the  nonrelativistic  part  of  the 
Hamiltonian 


<W5) 


From  this  it  can  be  seen  that  the  relativistic  equation  for  the 
Hamiltonian  (1.14)  also  includes  the  rest-mass  energy  m0c9  of  the 
electron.  It  is  very  important  to  take  this  rest-mass  energy  into 
account  in  studying  transformations  of  elementary  particles. 
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B.    THE  CLASSICAL  THEORY  OF  BLACK-BODY  RADIATION 

Among  all  the  phenomena  associated  with  electromagnetic  fields, 
special  importance  can  be  attached  to  the  properties  of  cavity  radia- 
tion. This  can  be  described  as  thejradiation  inside  a_cavi.ty.PQjaa- 

wMla^which  areieated  to  a  certain 


alternatively,  as  radtetipnl]^ 

^  A  small  hole  madejn  the  wall 


In  otherwoiggj  essentiallyjUl  rays  entejj^ 
Hole-Iarela6sorbed.  or.  moi^_^^cisely^  the 


reemerga^from  the_hole  is  negligibly  small.  Consequently*.  the 
cavity  radiation  escaplnglfrom  the  hole_may..Jaft  regarded  ftff  the 
radiation  which  would  be  obtained  Jzomjui  ideal  black^body,  jtnd  it 
is  generally  referred  to^sj5Iack-body  radiation. 

The  analysis  of  black-body  radiation  played  a  particularly  im- 
portant role  in  the  foundation  of  quantum  theory.  Although  a  more 
or  less  reasonable  classical  explanation  could  be  found  for  many 
other  phenomena,  every  single  theory  of  black-body  radiation  con- 
structed on  the  basis  of  classical  concepts  failed  to  agree  with  the 
experimental  facts.  A  systematic  theory  of  black-body  radiation 
was  developed  only  in  the  beginning  of  this  century,  when  Planck 
introduced  the  concept  of  a  quantum  of  energy.  This  concept 
was  later  to  play  an  important  role  in  the  development  of  the  first 
quantum  theory  of  the  atom,  and  afterwards  in  the  development  of 
quantum  mechanics. 

We  shall  now  consider  the  theory  of  black-body  radiation,  con- 
fining ourselves  for  the  time  being  to  classical  concepts.  Let  the 
radiation  be  characterized  by  its  spectral  density  Po,,1  which  is 
related  to  the  ordinary  electromagnetic  energy  density 


<1-16> 

by  the  relation 

P-  =  ZT.  (1-17) 


In   the   literature,   the  function  pv  is  sometimes  used  for  the  spectral  density.     The 
function  p^  is  related  to  pv  by  the  equation 


1 
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since  o>  =  2/m 
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where  du    is  the  energy  density  of  the  radiation  in  the  frequency 
range  from  o>  to  o>  +  d(o.    Obviously, 


(1.18) 


On  the  basis  of  the  second  law  of  thermodynamics,  Kirchhoff 
showed  that  the  density  pw  is  determined  only  by  the  temperature 
of  the  walls  of  the  closed  cavity  and  is  entirely  independent  of  the 
material  of  which  the  walls  are  made;  that  is,  p(1)=/(o),  T\ 

Consequently,  the  walls  of  the  cavity  may  be  considered  as  a  set  of  oscillators.  The 
average  energy  of  these  oscillators  is  completely  determined  by  the  spectral  density  of 
the  black-body  radiation.  We  shall  show  this  starling  with  the  equation  of  motion  for  an 
oscillator  and  taking  into  account  Planck's  radiation  damping 

=£IJ-  <U9> 

Here  e  and  m0  are  the  charge  and  rest  mass  of  the  oscillator,  o>  is  its  natural  frequency 
of  oscillations,  and  Ex  is  the  x  component  of  the  electric  field  intensity  of  the  black- 
body  radiation. 

Representing  Ex  in  the  form  of  a  Fourier  series 


=    2 


(1.20) 

where  Exn  is  the  amplitude  of  an  individual  oscillation  of  the  field  with  frequency 

<*«  =  no>o,  (Ul) 

we  obtain  the  following  equation  for  x(t)  from  Eq.  (1.19): 


„  99X 


The  average  energy  of  the  oscillator,  which,  according  to  the  virial  theorem,  is  twice 
the  average  kinetic  energy,  is  given  by 


(1.23) 


where  the  bar  denotes  averaging  over  time.  Since 


*  *      gtoot  (n-n') 
n,  n' *** — oo 
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where  f_n  =/f ,  and  since,  moreover, 


*>  =  !     f 

*     J 


2 

where  T  =  27i/o)0,  Eq.  (1.23)  can  be  reduced  to  the  form 


This  equation  has  a  very  sharp  maximum  in  the  neighborhood  of  frequency  o>,and 
hence  the  total  energy  of  the  oscillator  will  actually  depend  only  on  those  terms  of  the 
series  for  which  mo0  «s  u.  Consequently,  in  the  above  equation,  the  square  modulus  of  the 

amplitude  |  Exn  |2  can  be  replaced  by|  ExriQ  |,  where  /*0  =  —  ,  and  at  the  same  time  the  sum 
can  be  changed  to  an  integral.  According  to  (1.2 1),2 


d<*n  =  <o0drt  =  <o0  =  — ,  ( US) 


since  dn  =  1.  Therefore,  we  obtain 

fj.0%  I     E   y.n 

n- o     "OP         i     JT/IQ 


Replacing  the  frequency  wrt  by  <«>  everywhere  except  in  the  difference  o>n  —  w,  introduc- 
ing   the  variable   of  integration  S  =  wrt  —  <o,  and  extending   the  limits  of  integration  to 
±  oo,  we  find 


_37t/Z0Ca 

—  "2  ~a>r     '    ^o  '  ' 


(U7) 


On  the  other  hand,  the  energy  density  u,  which  is  related  to  the  electromagnetic  field 
of  the  radiation  by  Eq.  (JL 16),  can  also  be  expressed  in  terms  of  |  ExriQ  |*.  Since  the  radia- 
tion is  isotropic  we  have,  on  the  basis  of  Eq.  (1.16), 

1     71 


The  reader  should  not  feel  uneasy  about  our  equating  the  differential  dcon  to  the  finite 
quantity  O)0  =  277/r,  for  it  is  always  possible  to  make  the  period  T  sufficiently  large  so  that 
the  relationship  O)Q  <3C  a)  will  be  satisfied.  Mathematically,  this  corresponds  to  a  transi- 
tion from  Fourier  series  to  Fourier  integral. 
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Using  the  expansion  (1.20)  and  the  rule  (1.24)  for  averaging  over  1 9  we  obtain 


Hence,  taking  into  account  Eq.  (1.18)  together  with  the  relationship 


for«,t  =  o>  (n  =  /20)f  we  obtain 


Comparing  Eqs.  (1.31)  andjl.27),  we  find  the  relationship  between 
the  average  energy  £=;%£*  of  the  oscillator  and  the  spectral 
density  of  the  radiation  p^ 


d.32) 


which  forms  the  basis  of  the  theory  of  black-body  radiation. 

In    classical    statistical    physics,   the   energy   distribution  of 
particles  is  given  by  the  function 

N(E)  =  Ae-**.  (1.33) 

where  a=  1/£T;  k=  1.38  -  10~16  erg.deg'1  is  Boltzmann's  constant, 
and  T  is  the  temperature  of  the  medium.  The  average  energy  of  the 
particles  is 


(1.34) 


Substituting  this  value  of  E  into  Eq.  (1.32),  we  obtain  the  Rayleigh- 
Jeans  formula 

9«=-4r-kT.  (1.35) 
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This  equation  satisfies  the  Wien's  thermodynamic  law 


(1.36) 


which  was  based  on  various  results  in  thermodynamics  and  the 
electromagnetic  theory  of  light.  In  the  region  of  long  wavelengths 
(low  frequencies),  the  Ray leigh- Jeans  formula  is  in  good  agree- 
ment with  experimental  data.  At  short  wavelengths,  however,  it 
completely  fails  to  agree  with  experiment  (see  Fig.  1.1). 


Fig.    1.1.  Radiation  spectrum  of  an  ideal  black  body. 

The  heavy   dotted   line   indicates   the   Ray  leigh- Jeans 

2 
curve  pa  ~  pQX  ,  and  the  solid  line  the  Planck  curve 

Pa  ~pox  /(e*  ~  1)»  which  is  the  same  as  the  experi- 
mental curve.    Here  p$  —  (&T)  /n  h  c   ,  0)  =  &)QX,  and 
0)0  =  kT/H. 

In  exactly  the  same  way,  the  use  of  the  Ray  leigh- Jeans  formula 
for  calculation  of  the  radiation  energy  density  [see  Eq.  (1.18)]  re- 
sults in  a  divergent  integral,  that  is,  we  obtain  the  obviously  absurd 
relationship 


S1  bT(* 

P«d»=-JiJT  J 
0  0 


>  =  oo. 


(1.37) 


This  was  called  the  "ultraviolet  catastrophe"  by  Ehrenfest.  Thus, 
the  classical  theory  was  completely  unable  to  give  a  satisfactory 
description  of  black-body  radiation. 


C.    PLANCK'S  EQUATION 

In  1900,  Planck  put  forward  an  important  hypothesis  which  re- 
moved the  ultraviolet  catastrophe  and  radically  changed  a  number 
of  fundamental  principles  of  classical  physics.  According  to  this 
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hypothesis,  the  energy  of  microscopic  systems  (atoms,  molecules, 
and  so  forth)  does  not  vary  continuously  and  assumes  only  certain 
specific  discrete  values.  In  particular,  the  energy  of  a  harmonic 
oscillator  must  be  a  multiple  of  a  certain  minimum  value  e: 

£  =  /ie,  (1.38) 

where  AZ  =  O,  1,  2  .... 

In  order  to  determine  the  average  value  of  the  energy,  we  must 
replace  the  integral  (1.34)  by  the  sum 


d.39) 


Substituting  this  value  of  E  into  Eq.  (1.32),  we  obtain  the  spectral 
density  of  the  radiation 


coa  s 

"rf?"  ~        "  (1.40) 

ek1  —  \ 


We  can  bring  this  equation  into  agreement  with  Wien's  thermo- 
dynamic  law  by  letting  £  be  proportional  to  u> : 

e  =  *o.  (1.41) 

We  then  obtain  Planck's  equation 

fiu* 
Pu)== S« '  (1.42) 

n«<j«(**r  — 1) 

which  was  a  brilliant  achievement  of  quantum  theory. 

The  quantity  ft  =  1.05  •  10~*7erg.sec,  which  has  the  dimensions  of 
action,  is  called  Planck's  constant.3 

At  low  frequencies  f^<  l),  the  exponential  e*to/kT  may  be  expanded 

in  the  form  of  a  power  series  in  ft«>/W.  Restricting  ouselves  to  the 
linear  terms  of  the  expansion,  we  have 


3In   the  literature,  Planck's   constant  is  more  often  taken  as  the  quantity  h  = 
6.6249  •  10  27  erg  •  sec,  which  relates  the  energy  €  to  the  frequency  l/: 

(  =  hv . 
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Thus,    Planck's    equation    (1.42)   reduces   to   the    Rayleigh-Jeans 
formula  (1.35). 

In  the  case  of  high  frequencies  (jy!>  1),  we  may  neglect  the  1  in 
the  denominator  of  Eq.  (1.42)  and  write  pw  in  the  form 

P.  » _  a — fl<Q/hT  /I     /I  Q  \ 

CD —    rcV  *  V1-^"/ 

Planck's  equation  (1.42),  which  describes  the  dependence  of  the 
spectral  density  p^  of  thermal  radiation  on  the  frequency  u>t  is  in 
excellent  agreement  with  experiment  (see  Fig.  1.1). 

From  Eqs.  (1.42)  and  (1.18),  we  can  find  the  total  radiation 
density 

OO  00 

«  =  J^-nM-3££rr-  <M4) 

0  0 

Introducing  the  variable  l  =  fi<o/kT  and  considering  that 


we  obtain  the  well-known  Stefan-Boltzmann  law4 


where 

=  7.56-  10-"  erg.cm-3.deg-4.  (1.46) 


From  Eq.  (1.42),  it  can  be  seen  that  the  spectral  density  of 
black-body  radiation  has  a  maximum  at  some  value  of  co  and  that 
the  position  of  this  maximum  changes  with  temperature.  The 
equation  governing  the  position  of  this  ma^pum  is  called  Wien's 
displacement  law.  More  often,  Wien's  displacement  law  is  expressed 


Usually  it  is  not  the  density  u  which  is  measured,  but  the  energy  oT   which  is  radiated 
per  second  per  square  centimeter  of  the  black  body's   surface  within  a  solid  angle  of  2lT 

fl  -  277).    In  this  case,  the  Stcfan-Boltzmann  constant  is  a  -  277  —  a  =  —  a  =  5.67  •  10~5 

877          4 

2  4  i 

erg  •  cm       •  deg      •  sec 
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in  terms  of  the  spectral  distribution  with  respect  to  the  wavelengths 
X  „  To  determine  pA,  we  can  use  the  expression  for  u 


00 

..;, 


Since    X  —  2^c/a),  transforming  to  the  spectral  distribution  over  the 
frequencies,  we  have 


from  which  we  find 

_    2itc         _       \§K*cU 

Px  —  —  x»—  P»—        2^5        •  (1.48) 

X*(«*n  —  1) 

To  determine  the  wavelength   Xmax  at  which  px  has  its  maximum, 

we  set  $-x  =  0: 

CM 


-5-h- 


Setting  <2KcH/kTXmax  =  y,  we  obtain  the  equation 

y  =  5(l-£rv)f 

whose  solution  can  be  given  with  good  accuracy  in  the  form 

i/^5(l—  <r8)  =  4.965. 
Thus   ^max  is  related^  the  temperature  7  by  the  equation 

=3i  =  *  =  0.29  cm-deg,  (1.49) 


which  expresses  Wien's  displacement  law,  and  where  b  is  the 
Wien's  constant.  According  to  this  law,  as  the  temperature  of  an 
ideal  black  body  increases,  the  maximum  of  the  radiation  intensity 
is  shifted  towards  shorter  wavelengths  (see  Fig.  1.2). 
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Equations  (1.46)  and  (1.49)  relate  Planck's  constant  H  and  Boltz- 
mann's  constant  k  to  the  constants  a  and  b. 

Knowing  the  numerical  values  of  a  and  b ,  we  can  determine 
ft  and  k.  This  is  the  way  in  which  a  numerical  value  was  first 
obtained  for  h  and  a  better  value  found  for  k. 


1.44  192 


Fig.   1.2.  Wien's  displacement  law 

Curves  of  the  spectral  energy  distribution  as 

a  function  of  temperature  for  an  ideal  black 

body:  AmaxT  ^  0  29  cm  .deg 


Recapitulating,  it  follows  from  Planck's  hypothesis  that  proc- 
esses such  as  emission  and  absorption  involve  discrete  quanta.  In 
other  words,  the  eivergy  change  of  particles  involved  in  these  proc- 
esses is  discontinuous  and  not  smooth  as  would  follow  from  the 
laws  of  classical  physics. 


D.    EINSTEIN'S  PHOTON  THEORY 

In  deriving  his  equation,  Planck  assumed  that  the  energy  of  the 
oscillators  is  quantized.  The  original  version  of  the  theory,  however, 
does  not  provide  any  physical  justification  for  this  property.  Indeed, 
Planck  himself  chose  to  attribute  the  "special  properties"  to  the 
heated  body  rather  than  to  the  electromagnetic  radiation. 

The  second  important  step  towards  the  jg^elopment  of  quantum 
theory  consisted  of  Einstein's  hypothesis  mat  oscillators  absorb 
and  emit  radiation  in  discrete  amounts  because  electromagnetic 
radiation  itself  consists  of  discrete  particles,  called  photons,  which 
carry  an  amount  of  energy  ftu>.  In  effect,  Einstein  interpreted 
Planck's  equation  as  a  description  of  the  corpuscular  properties 
of  light. 

We  shall  now  attempt  to  develop  an  elementary  theory  of 
photons. 
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According  to  classical  theory,  the  energy  of  a  light  wave  is 

e  =  -^  J  (£'  +  //2)  <Px  =  I-  J  E2  <Px,  (1.50) 

where  d*x  is  an  element  of  volume,  and  the  integration  extends  over 
all  space.  The  electromagnetic  momentum  ji  of  the  light  wave  in 
classical  theory  is 


According  to  Eq.  (1.4),  this  can  also  be  written  in  the  form 


Comparing    (l.Sla)   and    (1.50),   we   find  the  relation  between  the 
momentum  at  and  the  energy  e  : 

*  =  *°CE.  (1.52) 

In  the  theory  of  relativity,  a  similar  relation  between  energy 
and  momentum  holds  for  particles  with  zero  rest  mass  and  it  can 
easily  be  obtained  from  Eq.  (1.14)  by  substituting  mn—-Qt  <P  =  0 
and  4  =  0. 

From  these  considerations,  Einstein  concluded  that  an  electro- 
magnetic field  can  be  considered  as  a  set  of  particles  called  photons, 
with  zero  rest  mass  and  the  energy 

6  =  »w.  (1.53) 

For  the  photon  momentum  the  following  equation  is  obtained: 

n  =  #*2  =  #*-  =  nk,  (1.54) 


where  h  =  2rcft  ,  and  £  =  -—  is  the  wave  vector  k  =  --  is  the  wave 


number). 

On  the  basis  of  these  concepts,  in  1905,  Einstein  constructed  a 
quantitative  theory  of  the  photoelectric  effect,  which  had  been  dis- 
covered by  Hertz  in  1887.  What  is  observed  in  the  photoelectric 
effect  is  the  following:  the  potential  difference  required  for  a 
spark  to  jump  between  two  small  charged  spheres  is  reduced  if  the 
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cathode    is    illuminated.    To   explain   this   phenomenon,  Einstein 
postulated  the  simple  equation 

Mo#3         -a.          iw  /I    c;c;\ 

— — ~-  j^to — .  U/.  ^A.tJcJ/ 


This  is  essentially  the  law  of  conservation  of  energy  and  indicates 
that  the  kinetic  energy  —  !~  of  the  ejected  electron  is  equal  to  the 

difference  between  the  energy  of  the  absorbed  photon  Hw  and  the 
work  function  W  of  an  electron  in  the  metal.  It  is  obvious  that  if 
tiux^W,  electrons  cannot  be  ejected  from  the  metal.  Only  if  the 
energy  of  the  incident  photons  exceeds  W  can  electrons  leave  the 
metal. 

The  experimental  verification  of  Einstein's  theory  of  the  photo- 
electric effect  provided  striking  confirmation  of  his  basic  conclusion 
that  the  energy  of  the  ejected  electrons  depends  only  on  the  fre- 
quency of  the  incident  light  and  not  on  its  intensity,  and  that  the 
emission  of  photoelectrons  begins  only  when  the  frequency  of 
light  a)  exceeds  a  certain  limit  (so-called  threshold  frequency) 


The_implications   of   the   photon  theory  were  brought  out  and 
verified   in    1923   by   experiments   on  the  scattering  of  x-rays  by 
free  electrons  (ite  Compton  effect). 
jrEe  Compton  effect  was,  particularly 
"interesting "because  it  .confirmed  not 
pnly  the  law  of  c observation  of  energy 
(which  was  already  verified   by  the 
photoelectric  effect)  but  also  the  law 
of  conservation  of  momentum.  "     Flg" }  -3;  Scat^rmrg  of  ;ight  fbfy  '  free 

.  .  .  electron  (the  Compton  effect) 

It  is  well  known  that,  in  classical 
theory,  the  frequency  of  light  does  not 

change  when  it  is  scattered  by  a  free  electron  (u/  =  w).  By  contrast, 
in  quantum  theory,  part  of  the  photon's  energy  e  =  #u>  is  transferred 
to  the  electron  (see  Fig.  1.3).  Consequently,  the  energy  and 
frequency  of  a  scattered  photon  should  generally  be  somewhat 
smaller  (e'<e,  u/O) .  To  find  the  dependence  of  frequency  on  the 
scattering  angle,  let  us  write  the  laws  of  conservation  of  energy 
and  momentum,  treating  the  photons  as  particles: 

fito  —  ftu/  =  cl  (tn  —  Wo)» 


Here   m0    and  w  =  m0/)/l  —  p*  represent   the   mass    of  the  electron 
before  and  after  collision;  v  is  its  velocity;  p  =  w/c;  Hk  =  n^/c  and 
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tik'  =  fno'/c  represent  the  momentum  of  the  photon  before  and  after 
scattering*  We  rewrite  Eq.  (1.56)  in  the  form 


c2 

u)  —  u/  =  j- -  (m  —  m0) , 
k  —  K  —  — 

R        K—          . 


(1.56a) 


Taking   the   square   of  these   equations    and   subtracting  the  first 
equation  from  the  second,  we  obtain 

too/  (1  —  cos  0)  =  !?£-  (ceo  —ceo').  (1.57) 

Substituting  X  =  2rcc/u),   X'  =  2irc/«/,  and  dividing  (1.57)  by  <DO/,  we  find 
an  expression  for  the  increase  in  wavelength  of  the  scattered  light 

AX  =  X'  —  X  =  2X0sina!-,  (1.58) 

where  X0  is  the  Compton  wavelength  of  the  electron 


cm. 


We  therefore  see  that,  according  to  quantum  concepts,  the  wave- 
length of  the  scattered  light  X'  must  be  greater  than  the  initial  wave- 
length X(A/>X)  since  w'O.  This  difference  increases  with  the 
scattering  angle  0.  Since  the  Compton  wavelength  X0  is  relatively 
small,  Compton  scattering  is  observed  at  relatively  short  wave- 
lengths (  x-rays  and  gamma  rays  ).  Indeed,  for  visible  light 
(X~10  8  cm) 


whereas  for  x-rays  (X~  10  8—  10~9  cm) 


Therefore,  the  Compton  shift  can  be  observed  experimentally  only 
in  the  second  case. 

In  his  experiments,  Compton  studied  the  scattering  of  radiation 
from  an  x-ray  tube  by  graphite  and  other  substances  (lithium, 
beryllium,  sodium,  potassium,  iron,  nickel,  copper,  and  so  on)  at 
different  angles  ft.  The  spectral  distribution  of  the  intensity  of  the 
scattered  radiation  at  different  scattering  angles  was  measured  by 
means  of  an  ionization  chamber. 
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Figure  1.4  shows  the  spectral  distribution  of  incident  and 
scattered  waves.  If  the  incident  wave  (upper  curve)  has  one  maxi- 
mum, the  scattered  wave  (lower  curve)  will  have,  in  addition  to 
this  maximum,  a  second  maximum  at  a  longer  wavelength.  The 


Incident 
wave 


X 

Wavelength 


Scattered 
wave 


Wavelength 

Kig.    1   1-.    Spectral   distribution   of  x-rays  in 

the     Compton    effect     before    (upper    curve) 

and  after  (lower  curve)  scattering 


distance  between  the  wavelengths  of  the  two  maxima  must  corre- 
spond to  the  Compton  shift;  this  is  because  the  distance  increases 
with  the  scattering  angle,  and,  in  addition,  it  does  not  depend  on 
the  type  of  scattering  material  [both  these  facts  are  in  accord 
with  Eq.  (1.58)5].  The  unshifted  maximum  corresponds  to  scatter- 
ing by  electrons  which  are  strongly  bound  to  the  nucleus  (or 
more  precisely,  electrons  whose  binding  energy  is  greater  than 
the  energy  of  the  x-ray  quanta).  The  shifted  maximum  corresponds 
to  scattering  by  electrons  which  are  so  weakly  bound  to  the  nucleus 
that,  in  practice,  they  can  be  regarded  as  free. 

Thus  the  results  of  Compton' s  experiments  completely  confirm 
the  quantum  nature  of  light  (that  is,  the  photon  theory). 


Only  the  intensity  of  the  maxima  depends  on  the  type  of  scattering  substance.  As  the 
atomic  weight  of  the  scattering  substance  increases,  the  intensity  of  the  unshifted  maxi- 
mum increases,  and  that  of  the  shifted  maximum  decreases. 


Chapter  2 

The  Bohr  Quantum  Theory 

A.    BASIC   INFORMATION  ON  PROPERTIES  OF  ATOMS 

A  theory  of  the  atom  was  developed  only  after  reliable  experi- 
mental data  had  been  obtained  from  studies  of  the  effects  described 
below* 

1)  Emission  of  light  by  atoms.  From  careful  studies  of  the 
radiation  of  atoms,  it  was  established  that  they  have  bright-line 
spectra  and  that  the  lines  are  arranged  in  certain  definite  series. 
For  example,  all  the  lines  of  hydrogen  are  described  by  Balmer's 
formula1 


where  R  is  the  Rydberg  constant,  and  n'and  n  are  integers.  Setting 
n'  =  1  and  n  —  2,  3,  4, . . . ,  we  obtain  the  Lyman  series,  which  lies  in 
the  ultraviolet  part  of  the  spectrum.  For  n'—  2  and  n  =  3,  4,  5,  ..., 
we  have  the  Balmer  series,  which  is  located  in  the  visible  part  of 
the  spectrum  and  is,  therefore,  easiest  to  study. 

Formula  (2.1)  can  also  be  written  in  the  form  of  a  difference 
between  two  quantities 


In  spcctroscopy,  it  is  customary  to  write  Balmcr's  formula  in  the  form 


where  the  Rydberg  constant  for  hydrogen  is  Kqp  109,677.  h  cm"1.  The  value  of  the  Rydberg 
constant  in  Eq  (2  1)  is  related  to  Khp  by  the  equation 

K          2nvtfsp          2/7-  3  29  •   1015  sec    l     -    20  66  •  10KS  sec    l 

Equation  (2.1)  is  not  too  convenient  for  spectroscopial  use,  since  it  is  usually  the  wave- 
lengths which  are  experimentally  determined.  In  developing  a  theory,  we  do  not  need  to 
take  this  reservation  into  account,  since  we  can  always  transform  from  the  frequency  oj  to 
the  wavelength  A 

-    -    J? 
A         277  c 
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which  are  called  spectral  terms.  For  the  hydrogen  atom,  these 
terms  are  given  by 

T  — ~ 

n  —  n2  * 

This  possibility  of  representing  the  radiation  frequencies  «  as  a 
difference  between  two  terms  is  a  consequence  of  the  Ritz  combina- 
tion principle,  which  has  important  spectroscopic  applications  in 
regard  to  the  hydrogen  atom,  as  well  as  more  complex  atoms.  For 
example,  hydrogen  was  initially  found  to  have  two  series,  corre- 
sponding to  /i'=l  (the  Lyman  series)  and  to  n'  =  2  (the  Balmer 
series).  On  the  basis  of  the  Ritz  combination  principle,2  a  third 
series  was  predicted  with  /i'  =  3  and  /i  =  4,  5,  6,,...  This  series  was 
later  discovered  by  Paschen  in  the  infrared  region  of  the  spectrum. 

2)  The  behavior  of  an  atom  in  external  electric  and  magnetic 
fields  and,  in  particular,  the  interaction  of  the  atoms  of  a  substance 
with  fast  particles  passing  through  it.  The  most  important  experi- 
ments in  this  area  were  conducted  by  Rutherford,  who  succeeded 
in  finding  the  distribution  of  positive  charges  inside  the  atom  from 
the  analysis  of  fast-a -particle  scattering. 

3)  Finally,   investigation  of  various   properties  of  molecules 
provided  important  data  pertaining  to  the  properties  of  atoms.  For 
example,    the   formation   of  simple  homopolar  molecules  and  the 
valence   theory   found   their   explanation   only   on  the  basis  of  the 
modern  quantum  theory  of  the  atom. 


B.  THE  CLASSICAL  MODEL  OF  THE  ATOM 

Once  it  had  been  established  that  an  atom  consists  of  a  positively 
charged  part  associated  with  most  of  the  mass,  and  of  light, 
negatively  charged  electrons,  attempts  were  made  to  construct  a 
static  model.  The  reason  this  approach  to  the  problem  was  adopted 
is  that,  in  classical  electrodynamics,  an  accelerated  electron  emits 
radiation,  the  amount  of  energy  emitted  per  unit  time  being 

*— =".  (2-2) 


The  Ritz  combination  principle  was  first  formulated  a?  follows  if  there  are  two 
different  frequencies  belonging  to  the  same  series,  the  difference  between  these  fre- 
quencies is  also  a  frequency  which  can  also  be  emitted  by  the  atom,  but  belongs  to  another 
series.  The  concept  of  "terms",  permits  a  relatively  simple  explanation  of  this.  Indeed 


Hence 


and  thus  the  Ritz  combination  principle  leads  directly  to  Eq    (2  la) 
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where  e  =  —  e0    is   the   electron  charge  (e0  =  4,80  •  10"i0esu  is  the 
elementary  charge),  ay  is  the  acceleration  of  the  electron,  and  c  is 

the  velocity  of  light  in  vacuum.  The  minus 
sign  in  front  of  dE/dt  shows  that  the  energy 
of  the  electron  decreases  as  a  result  of 
the  emission  of  radiant  energy.  Since  an 
atom  does  not  radiate  in  the  ground  state, 
it  follows  from  the  classical  theory  that 
the  charges  in  the  atom  should  be  at  rest. 
The  most  interesting  classical  model 
was  that  of  Thomson,  according  to  which 
the  positive  charge  uniformly  filled  the 
entire  atomic  volume,  and  the  electronic, 
that  is,  negative  point  charges  were  lo- 
cated inside  the  atom. 

For  example,  in  the  hydrogen  atom,  the 
positive  charge  was  supposed  to  fill  uni- 
formly a  sphere  of  radius  /?0  (see  Fig. 
2.1).  The  charge  density  inside  the  sphere 
was  (for  Z=  1) 


Fig.  2.1.  Thomson's  model  of 
the  hydrogen  atom  (7,  -  1) 
The  positive  charge  ZrQ  is 
uniformly  distributed  over 
the  volume  of  a  sphere  of 
radius  K0  The  electron  (with 
charge  -t'0)  is  located  at  a 
distance  x  from  thf  center  of 
the  atom 


3*0 


In  the  ground  state,  the  electron  was  supposed  to  be  located  at  the 
center  of  the  sphere,  where  the  electric  field  is  zero.  At  a  distance 
r  =  .v<]?0  from  the  center,  the  electric  field  E  is  directed  along 
the  radius  and  its  intensity  can  be  found  from  Gauss f  law: 


Hence 


*°  r 
r' 


(2.3) 


Therefore,  if  an  electron  with  charge  e  =  —  eQ  and  mass  w0  is 
placed  at  a  distance  x  from  the  center  of  the  atom,  it  experiences 
a  quasi-elastic  attractive  force  towards  the  center 


With  this  force,  the  differential  equation  describing  the  motion  of 
the  electron  is 

j?-f.a)5A:  =  0,  (2.4) 

The  solution  of  this  equation  is 
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where 


Substituting  for  to0  the  fundamental  frequency  observed  in  the  Balmer 
series,  we  obtain  a  very  reasonable  value  for  the  radius  of  the 
atom,  namely,  RQ~IQ-*  cm.  This  value  is  many  times  greater 
than  the  classical  radius  of  the  electron 


The  Thomson  model  agreed  completely  with  the  classical 
Lorentz  theory,  according  to  which  atoms  can  be  represented  as 
harmonic  oscillators.  Unfortunately,  the  Thomson  model  could  not 
explain  the  regularities  of  the  line  spectra  of  atoms  and,  in  particu- 
lar, the  spectral  series  of  hydrogen  that  are  described  by  Eq.  (2.1). 
Indeed,  from  the  standpoint  of  classical  theory,  the  Thomson  model 
could  emit  radiant  energy  only  at  the  fundamental  frequency  u)0  or, 
at  best,  at  its  harmonics 

ci)/I  =  /iajtt,  (2.5) 

where  /i=l,  2,  3,... 

The  decisive  blow  to  the  Thomson  model  was  dealt  by  the 
experiments  of  Rutherford,  who  showed  that  the  positive  charge  is 
not  distributed  throughout  the  entire  volume  of  the  atom,  but  is 
concentrated  virtually  at  one  point.  Nevertheless,  the  Thomson 
potential  inside  a  nucleus  of  finite  dimensions,  with  a  charge  Ze03 
uniformly  distributed  through  the  volume,  is 


3  If  the  charge  of  the  nucleus  is  ZeQ,  the  electric  field  intensity  inside  the  nucleus  is 


=  r  =        __ 

RQ        ~        dr    r  ' 

from  which,  using  the  boundary  condition  that  at  r  ~  RQ  the  potential  is  the  same  as  for  a 
point  charge 


we  obtain  Eq.  (2  6). 
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does  play  an  important  role,  especially  when  corrections  for  the 
volume  of  the  nucleus  must  be  made.  Moreover,  when  mesic  atoms 
with  large  Z  are  formed  (in  a  mesic  atom,  an  electron  is  replaced 
by  a  negative  \JL  meson,  whose  mass  is  207  times  larger  than  the 
mass  of  the  electron),  there  may  be  states  in  which  the  negative  u 
meson  is  always  inside  the  nucleus.  In  this  case,  the  motion  of  the 
M  meson  will  be  determined  mainly  by  the  potential  (2.6).  The  ap- 
propriate equations  of  motion  will,  however,  be  quantum-mechanical, 
rather  than  classical  (see  Chapter  21). 


C.    RUTHERFORD'S  EXPERIMENTS  AND  CONCEPTS  OF 
ATOMIC  STRUCTURE 

Our  present  model  of  the  atom  is  based  on  the  famous  experi- 
ments conducted  by  Rutherford  in  1911  on  the  passage  of  alpha 
particles  through  matter.  It  is  well  known  that  alpha  particles, 
which  are  products  of  nuclear  disintegration,  possess  a  sufficient 
energy  to  penetrate  into  an  atom.  At  the  time  when  Rutherford 
conducted  his  experiments,  there  was  no  other  source  which  could 
produce  sufficiently  heavy  particles  (that  is,  particles  with  a  mass 
comparable  to  the  nucleus)  with  an  energy  great  enough  to  penetrate 
the  atom. 

By  passing  alpha  particles  through  thin  sheets  of  metal  (foil), 
Rutherford  showed4  that  most  of  the  alpha  particles  which  pass 
through  the  foil  are  scattered  through  relatively  small  angles  (2-3°) 
from  their  initial  direction  of  motion.  Within  the  framework  of 
the  Thomson  model,  these  small  deflections  could  be  explained  in 
terms  of  the  statistical  theory  of  random  processes,  because  of 
the  relatively  weak  interaction  between  the  atoms  and  the  alpha 
particles.  Rutherford  and  his  co-workers,  however,  also  detected 
individual  deflections  of  alpha  particles  through  very  large  angles 
of  up  to  180°.  Although  the  number  of  these  deflections  was  very 
small  (for  example,  when  a  beam  of  8000  primary  alpha  particles 
from  RaC  is  passed  through  platinum  foil,  at  most  one  particle  is 
deflected  through  an  angle  greater  than  90°),  it  was,  nevertheless, 
much  larger  than  the  number  which  could  be  predicted  on  the  basis 
of  superposition  of  a  large  number  of  small  random  deflections. 

Large  scattering  angles  were  also  observed  when  alpha  particles 
were  passed  through  a  gas.  These  could  be  easily  seen  in  photo- 
graphs taken  in  a  Wilson  cloud  chamber. 

From  a  general  analysis  of  his  experiments,  Rutherford  estab- 
lished, first,  that  atoms  are  fairly  transparent  to  alpha  particles 


In  earlier  experiments,  Rutherford  and  his  co-workers  had  established  that  alpha 
particles  have  the  same  mass  as  the  helium  atom,  and  a  positive  charge  which  is  twice 
the  magnitude  of  the  electron  charge  It  is  now  known  that  alpha  particles  are  the  nuclei 
of  helium  atoms 
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(that  is,  their  structure  is  relatively  "open");  and,  second,  that 
large  deflections  can  take  place  only  if  a  very  strong  electric  field 
exists  inside  the  atom.  This  electric  field  must  be  produced  by  a 
positive  charge  which  is  associated  with  a  large  mass  and  con- 
centrated in  a  very  small  volume.  We  note  in  this  connection  that, 
according  to  Eq.  (2.6),  the  largest  field  produced  by  a  nucleus  of 
radius  RQ  is 


To  explain  these  results,  Rutherford  proposed  a  planetary 
model  of  the  atom  in  which  the  structure  of  the  atom  resembles 
a  planetary  system.  A  positively  charged  nucleus  constituting 
almost  the  entire  mass  of  the  atom  is  concentrated  at  the  center 
in  a  very  small  volume  of  radius  /?0^  10"13  —  10"",  and  charged 
electrons  move  about  this  nucleus  in  closed  orbits  like  planets 
around  the  Sun.  We  note  that  the  potential  energy  of  the  Newtonian 
attraction  between  a  planet  of  mass  m  and  the  Sun  (of  mass  M) 

y  _  _  X/llAf 

Newt  ~>"> 

where  7.  is  the  gravitational  constant  and  has  the  same  form  as  the 
potential  energy  of  the  Coulomb  attraction  between  an  electron  and 
a  nucleus 


From  this  model,  Rutherford  developed  a  quantitative  theory  of 
scattering.  His  calculations  were  based  on  the  assumption  of  a 
Coulomb  interaction  between  the  alpha  particles  and  the  nucleus. 
The  influence  of  the  atomic  electrons  was  neglected  in  the  first 
approximation,  since  their  energy  is  considerably  lower  than  the 
energy  of  the  bombarding  particles. 

Let  us  find,  following  Rutherford,  the  trajectory  of  an  alpha 
particle  moving  in  the  field  of  an  infinitely  heavy  s  point  nucleus 
having  a  charge  Ze0.  Our  calculations  will  be  carried  out  in  a 


If  the  fimteness  of  the  nuclear  mass  ,\/nut  is  taken  into  account,  the  nucleus  has  a 
certain  recoil  (like  that  of  the  alpha  particle)  as  a  result  of  the  interaction  In  this  case, 
all  the  calculations  must  be  performed  in  the  center-of-mass  system  and,  m  the  results 
obtained  for  the  case  Mnuc  --  «*,  it  is  necessary  to  replace  the  mass  of  the  alpha  particle 
M0  by  the  reduced  mass 

Af0Mnuc 
Mred 

M0  i  Mnut 
(see  Chapter  12,  Section  C  for  a  discussion  of  the  reduced  mass) 
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coordinate  system  whose  origin  coincides  with  the  nucleus  (see 
Fig.  2.2).  Since  the  field  produced  by  the  nucleus  is  centrally 
symmetric,  in  determining  the  trajectory  of  the  alpha  particles  we 
can  use  both  the  law  of  conservation  of  energy 


E  =  const, 
and  the  law  of  conservation  of  angular  momentum 

L==M0(r  x  v)= const, 


(2.7) 


(2.8) 


where    M0   is   the  mass  of  the  alpha  particle,  r  is  its  coordinate, 
and  v  is  its  velocity. 


Fig.    2.2.    Diagram    for   the   derivation    of   Ruther- 
ford's   formula    for   the    cross    section    of   elastic 
scattering  of  alpha  particles  by  nuclei. 


Let  us  introduce  the  polar  coordinates  r  and  ?.  The  velocity  of 
the  particle  is  given  by 

where  v\\=f  and  vt  =  r$  are  the  components  of  velocity  parallel 
and  perpendicular  to  the  radius  vector  r,  respectively,  andr=~ 
and  $  =t>- .  We  then  obtain,  instead  of  Eqs.  (2.7)  and  (2.8), 


Lz  =  M0  (r x  v\  =  Mtf*$  =  const . 


(2.10) 
(2.11) 


In  the  absence  of  interaction,  the  alpha  particle  would  pass  the 
nucleus  at  a  distance  b  (this  distance  b  is  called  the  impact  para- 
meter). Setting  the  initial  velocity  equal  to  v{}  (that  is,  the  velocity 
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r_^_oo   and    f-*rc,   as   follows   from    Fig.    2.2),  then  (2.10)  and 
(2.11)  can  be  reduced  to  the  form 

=  £„.  (2.12) 


(2.13) 

where  the  initial  energy  E0    is  related  to  the  initial  velocity  VQ  by 
the  equation 

^  =  £..  (2.14) 

Introducing  the  new  variable 

K  =  l  (2.15) 

and  noting  that  then,  according  to  (2.13), 

|t|  =  *»=ttf««,  (2.16) 

and 


where  u'  =  -~,  we  transform  Eq.  (2.12)  to 


Differentiating  this  equation  with  respect  to  cp,  we  obtain 


Hence 

n  =  ^cos?  +  Bsin<p  --  jjgg,.  (2.20) 

The  unknown  constant  coefficients  /4  and  5  can  be  determined 
from  the  initial  conditions 


and 
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Setting  <p^=*  and  u  =--0  in  Eq.  (2.20),  we  obtain 


(2.23) 


and,  consequently,  applying  the  condition  (2.22)  to  Eq.   (2.20),  we 
have 

B  =  y.  <2-24) 

Thus,  we  finally  obtain 


(2.25) 


This  equation  gives  the  relationship  between  the  absolute  value 
of  the  radius  vector  r  and  the  polar  angle  ?,  and  thus  describes  the 
trajectory  of  the  alpha  particle  in  the  Coulomb  field  of  the  nucleus. 
It  is  an  equation  for  a  hyperbolic  trajectory  in  polar  coordinates. 

By  definition,  the  scattering  angle  &  is  equal  to  the  angle 
cp  (<p^'?o  =  7U)  at  which  the  length  of  the  radius  vector  r  becomes 


infinite    (that  is,  from  Fig.  2.2,    «  =  -i  = 
we  find 


).  Therefore,  from  (2.25), 


cot         i-z 

C0t  '          ~  > 


bEg 
'/.el 


(2.26) 


It   follows   that   the    scattering   angle   increases   as   b  decreases, 
attaining  180°  for  6  —  0  (see  Fig.  2.3). 


Q>*0)  Ie0 

Fig.    2  3     Dependence    of    the    scattering 
angle  I/  on  the  impact  parameter  h,  where 


Equation  (2.26)  can  be  checked  experimentally  by  photographing 
the  tracks  of  alpha  particles  in  a  Wilson  cloud  chamber.  From  the 
maximum  scattering  angle,  it  is  possible  to  compute  the  minimum 
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value  of  b  ,  which  turns  out  to  be  of  the  order  of  the  nuclear  radius. 
The  actual  form  of  interaction  between  an  alpha  particle  and  a 
nucleus  can,  however,  be  determined  more  accurately  from  an 
investigation  of  the  cross  section  for  the  scattering  of  alpha  particles 
by  nuclei.  For  this  purpose,  Rutherford  calculated  the  relative 
number  of  particles  scattered  at  an  angle  0  or,  to  be  precise,  the 
number  of  scattered  particles  which  we  would  expect  to  find  within 
the  solid  angle 

di3  =  2*sinOd».  (2.27) 

Suppose  N  particles  impinge  per  unit  time  on  a  unit  surface 
placed  perpendicularly  to  the  original  velocity  of  the  particles. 
From  Eq.  (2.26),  it  follows  that  the  scattering  angle  depends  only 
on  the  impact  parameter  b.  For  a  particle  to  be  scattered  through 
an  angle  &,  it  must  strike  a  ring  formed  by  two  circles  with  radii 
b  &ndb  —  db.  The  area  of  this  ring  is  2*bdb.  Therefore,  the  number 
of  particles  which  hit  this  area  and  then,  as  a  result  of  scattering, 
are  found  within  the  solid  angle  d&  is 

dN  =  N.2«\bdb\.  (2.28) 

From  Eq.  (2.26),  we  obtain  for  the  relative  number  of  particles 
scattered  through  an  angle  d 

(2.29) 

The  ratio  dNjN  has  the  dimensions  of  area  and  is  called  the  differ- 
ential  cross  section.  It  is  usually  denoted  by  do.  Taking  the  deriva- 

tive of  cot2^  in  Eq.  (2.29),  we  obtain  the  Rutherford  formula  for 
elastic  scattering  of  alpha  particles  by  a  Coulomb  center: 


This  formula  no  longer  depends  on  parameter  b. 

If  all  the  quantities  in  this  equation  are  kept  constant,  with  the 
exception  of  0,  we  would  expect  the  following  equation  to  hold: 


do  sin4     =  --    d2  =  const.  (2.31) 

/  \  Z£<>  / 

o 

For  0—  *7cf  it  is  found,  however,  that  the  quantity  sin4  y  do  ceases  to 

be    constant    and  begins   to   decrease   somewhat.    This   fact  was 
explained  by  Blackett,  one  of  Rutherford's  students,  who  studied 
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the  limits  of  applicability  of  the  Coulomb  law.  Blackett  took  a  large 
number  of  photographs  of  tracks  of  particles  in  a  Wilson  cloud 
chamber  and  calculated  the  frequencies  with  which  the  various 
scattering  angles  occur.  From  analysis  of  the  experimental  data,  he 
established  that  the  number  of  particles  observed  at  large  scatter- 
ing angles  [according  to  (2.26),  large  angles  correspond  to  small 
values  of  the  parameter  b  ]  is  markedly  smaller  than  the  number 
yielded  by  the  formula  (2.30).  On  this  basis,  Blackett  concluded 
that,  in  air,  for  example,  Coulomb's  law  is  valid  down  to  dis- 
tances of  the  order  of  3-10  u  cm.  At  smaller  distances,  there  is  a 
deviation  from  Coulomb's  law.  Indeed,  at  b<^^lfrl*  cm,  the 
interaction  between  the  alpha  particle  and  the  nucleus  appears  to 
take  the  form  of  a  strong  mutual  attraction.  Further  experimental 
investigations  have  confirmed  the  existence  of  a  characteristic 
attractive  force  at  distances  less  than  10  12  cm.  This  attraction 
drops  off  rapidly  with  increasing  distance  from  the  nucleus. 

The  Rutherford  formula  (2.30)  can  be  used  to  find  the  number 
Z  from  an  experimental  determination  of  dN  and  N.  This  was 
undertaken  by  Chadwick,  another  of  Rutherford's  students.  Chadwick 
showed  that  the  value  of  Z  is  very  close  to  the  atomic  number, 
which  gives  the  element's  position  in  Mendeleyev's  periodic  table. 
The  existence  of  this  phenomenon  was  rigorously  proved  at  a  later 
date. 

Thus  the  experiments  of  Rutherford  and  his  colleagues  definitely 
established  the  planetary  model  of  the  atom.  These  experiments 
proved  that  the  positive  charge  of  the  electron  is  concentrated  in 
a  nucleus  with  dimensions  of  10~13 — 10~12  cm,  and  that,  inside  the 
atom,  Coulomb  forces  keep  the  electrons  moving  in  orbits  with 
a  radius  of  the  order  of  10~8  cm. 

D.  THE  BOHR  THEORY 

First  of  all,  let  us  attempt  to  develop  Rutherford's  planetary 
model  of  the  atom  on  the  basis  of  classical  theory.  We  shall  re- 
strict ourselves  to  the  case  of  a  hydrogen-like  atom,  namely,  an 
atom  with  a  nucleus  of  charge  Ze0,  with  a  single  electron  (of 
charge  e  =  —  e0)  moving  around  it.  Particular  examples  of  hydrogen- 
like  atoms  include  hydrogen  (Z=\),  ionized  helium  (Z  =  2),  and 
so  forth. 

Introducing  the  polar  coordinates  rand  cp(#  =  rcoscp,  y  =  rsir\y)9 
we  obtain  the  following  equations  for  the  kinetic  energy  and  the 
Coulomb  potential  energy: 
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Then  the  Lagrangian  is 

X  =  *L  (,•  +  ry)  +  Z^  9  (2.32) 

where  m0  is  the  mass  of  the  electron.6 

From   this   Lagrangian,  we  obtain  the  following  equations  for 
the  motion  of  the  electron: 


ft 


Here 


and 

pr  =  ^jr  =  mj  (2.34) 

represent  the  generalized  momenta  associated  with  the  <p  and  r 
coordinates,  respectively.  Since  <p  does  not  occur  explicitly  in 
X  (in  other  words,  it  is  a  cyclic  coordinate),  it  follows  that 

-^£-==0.    Therefore,  the  corresponding  generalized  momentum  is  a 
cty 

constant  of  the  motion 

pv  =  mQr^  =  const.  (2.35) 

This  is  the  law  of  conservation  of  angular  momentum,  which  is  well 
known  from  classical  mechanics.  The  second  conservation  law, 
namely,  the  law  of  conservation  of  energy 

E  =  T+V  =  const  (2.36) 

is  obtained  from  the  condition  that  the  time  t  does  not  occur  ex- 
plicitly in  the  Lagrangian. 

We  shall  consider  only  the  simplest  case,  that  of  circular  orbits, 
for  which  f— -0.  Accordingly,  pr  =  mQf  vanishes,  and  from  (2.33)  we 
have 

=  0.  (2.37) 


6In  the  following  chapters,  the  mass  of  the  electron  in  nonrelativistic  theory  shall  be 
denoted  by  m0,  since  m  will  be  used  for  the  magnetic  quantum  number. 
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Hence 

*'-l£"  <2-38> 

Therefore,  we  obtain  for  the  energy  of  the  electron 

E  =  -|-Vy  -^-^--l-^^l  V.  (2.39) 

Let  us  now  express  the  basic  parameters  of  the  atom  in  terms 
of  the  adiabatic  invariants,  which  were  introduced  by  Ehrenfest. 
According  to  Ehrenfest'  s  definition,  in  the  case  of  periodic  motion, 
the  quantities 

It  (2.40) 

(pi  is  a  generalized  momentum,  and  x,  a  generalized  coordinate) 
remain  constant  during  slow  (adiabatic)  changes  of  the  parameters 
of  the  system  (for  instance,  the  charge). 

In  our  case,  there  is  only  one  degree  of  freedom  (x4  =  <p).  The 
conditions  (2.35)  and  (2.40)  yield  the  equation 

P,  =  "&**  =  -&,  (2.41) 

or 

*=-S5'  <2-42) 


From  Eqs.   (2.38)  and  (2.42),  we  can  obtain  expressions  for  r  and 
<?  in  terms  of  the  adiabatic  invariant: 

'=43i35r  (2'43) 

^M,=  «=!'i.  (2.44) 


According  to  Eq.  (2.39),  the  energy  of  the  electron  is  then 

£  =  _-2**-^l.  (2.45) 

It  follows  that  the  frequency  of  mechanical  oscillation  v0  is  given 
by  the  derivative  of  the  energy  with  respect  to  the  adiabatic  in- 
variant: 
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This  relation  holds  not  only  for  the  case  under  consideration,  but 
also  for  any  periodic  or  quasi- periodic  motion.7  Moreover,  a 
system  performing  any  periodic  motion  can  generally  emit  radia- 
tion not  only  at  the  first  harmonic  k=  1,  but  also  at  the  harmonics 
k  =  2,  3, 4, . . .  .  The  following  expression  is,  therefore,  obtained  for 
the  classical  frequency  of  radiation: 

v  =  Av0==A4j~.  (2.46a) 

The  classical  theory  of  the  planetary  model  of  the  atom  led  to 
a  number  of  difficulties  when  it  was  used  to  explain  the  radiation 
of  atoms.  Since  this  model  is  dynamic,  it  follows  from  classical 
electrodynamics  that  the  centripetal  acceleration  of  the  electron 

vz 

(w  =  ~9  where  v  is  its  velocity,  and  a  is  the  radius  of  its  orbit) 
will  cause  the  electron  to  lose  energy  at  the  rate 

dE  __  2  e*w* 
dt  ~~  3     c8 

until  it  falls  into  the  nucleus.  This  is  not,  however,  what  actually 
takes  place,  and  atoms  can  exist  in  anon- radiating  state  for  an  ar- 
bitrarily long  time.  Moreover,  according  to  the  classical  theory, 
the  frequency  of  the  radiation  should  be  the  same  as  the  frequency 
of  mechanical  oscillation  (the  fundamental  frequency  CD  =  u>0  =  2irv0 ) 
or,  alternatively,  an  integral  multiple  of  this  frequency  (one  of  the 
harmonics  «>„  =  m»0 ,  where  n=  2,  3,4,. . .).  Once  again,  this  predic- 
tion does  not  explain  Balmer's  experimentally  established  formula 
(2.1)  for  the  lines  in  the  spectrum  of  radiation. 


We  define  periodic  motion  as  a  motion  in  which  a  point  returns  to  its  initial  position 
after  a  certain  period  of  time.  As  examples  of  this,  we  can  take  harmonic  motion 

x     -    a  cos  fM 
or  motion  about  an  ellipse 

x    —    a  cos  tilt       and       v    —    b  sin  (Ot  , 

a  special  case  of  which  is  motion  about  a  circle  (a  =  6).  Quasi-periodic  motion  is  motion 
in  which  a  point  does  not  return,  as  a  rule,  to  its  initial  position,  but  each  individual 
coordinate  reassumes  its  initial  value  after  a  certain  period  of  time  (which  is  different 
for  each  of  the  coordinates).  As  an  example  of  quasi-periodic  motion,  we  can  take 

x    -    a  cos  co it  , 
y    -    b  cos  o)2*  , 
where  the  frequency  (o\  is  incommensurable  with  a>2- 


34  NONRELATIV1ST1C    QUANTUM     MECHANICS 

The  solution  to  these  difficulties  was  found  in  1913  by  Niels 
Bohr,  who  added  two  postulates  to  the  classical  laws  of  motion. 

First,  Bohr  assumed  that  each  atom  has  a  series  of  discrete 
stationary  states  in  which  the  electron  does  not  emit  radiation,  even 
though  its  motion  is  accelerated  (the  postulate  of  stationary  states). 
According  to  Bohr's  theory,  these  stationary  states  can  be  deter- 
mined by  quantizing  the  adiabatic  invariants: 

,dq{=inh,  (2.47) 

where  the  quantum  number  n  can  assume  only  integral  values,  that 
is,  n  =  1,  2,  3, ...  (in  classical  mechanics,  the  adiabatic  invariant/ 
could  assume  any  constant  value). 

Second,  Bohr  hypothesized  that,  when  an  electron  passes  from 
a  stationary  state  with  energy  En  (the  initial  state)  to  a  state  with 
an  energy  £*'<£„  (the  final  state),  the  atom  radiates  a  quantum  of 
energy  fa^ha*  (the  frequency  postulate),  whose  angular  frequency 
o>  is 

a,  =  *»-"*»'.  (2.48) 


This  equation  can  be  written  in  a  form  similar  to  the  classical 
expression  (2.46a)  for  the  frequency  of  radiation: 

*  =  (rt_n^.  =  *j^.  (2.48a) 

Here  the  quantum  number  k  =  n  —  n'  can  be  interpreted  as  the  cor- 
responding harmonic,  and,  moreover,  the  derivative  of  E  with 
respect  to  /,  which  occurs  in  the  classical  expression,  is  replaced 
by  the  ratio  of  the  finite  increments. 

Bohr's  postulate  that  the  stable  energy  states  of  atoms  form  a 
discrete  spectrum  was  confirmed  in  experiments  conducted  in  1919 
by  Franck  and  Hertz.  Passing  an  electronbeam  (a  current)  through 
mercury  vapor,  they  showed  that,  for  electron  energies  less  then 
4.9  ev,  the  collisions  of  electrons  with  Hg  atoms  do  not  effect 
the  magnitude  of  the  current.  When  the  electron  energy  attains 
4.9  ev  (see  Fig.  2.4),  the  current  suddenly  drops.  With  further  in- 
crease of  the  electron  energy,  periodic  sharp  dips  of  the  current 
are  observed  (approximately  every  4.9  ev).  This  phenomenon  can 
be  very  simply  interpreted  from  the  point  of  view  of  the  Bohr 
theory.  Let  us  suppose  that  the  energy  of  an  unexcited  Hg  atom 
(that  is,  the  atom  before  collision)  is  £0  and,  in  conformity  with  the 
first  Bohr  postulate,  let  us  take  the  next  possible  energy  value  to  be 
£1==£0  +  4.9  ev.  It  then  follows  that  a  beam  of  electrons  of  energy 
4.9  ev  is  not  sufficient  to  raise  the  atoms  to  an  excited  state; 
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therefore,  the  collisions  are  elastic  and  the  current  does  not  change. 
If  however,  £^>  4.9  ev,  the  electrons  in  the  beam  may  give  up  a 
part  of  their  energy  (namely,  4.9  ev)  to  the  atoms,  and  therefore, 
the  current  will  change.  If  the  electrons'  energy  lies  in  the  range 
14.7  ev  >£>  9.8  ev,  the  transfer  of  energy  to  the  atoms  can  occur 
twice;  4.9  ev  is  given  up  in  the  first  collision  and  4.9  ev  in  the 
second. 


0 


/4.7 


4.9  9.8 

Electron  energy,  ev 
Fig.    2.4.   The   dependence  of  current  on 
electron     energy     in     the     Franck-Hertz 


experiment  (1  ev  -  e0/300  ^ 
erg). 
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!  2 


We  shall  now  use  the  first  and  second  Bohr  postulates  [Eqs. 
(2.47)  and  (2.48)]  to  construct  a  theory  of  the  hydrogen-like  atom. 
In  the  equation  for  the  radius  of  the  orbit  (2.43)  and  the  equation  for 
the  energy  (2.45),  let  us  substitute  for  /  its  quantized  value,  which 
from  Eq.  (2.47)  is 


We  thus  have 


(2.49) 
(2.50) 


For  n=\9  we  obtain  the  energy  of  the  lowest  (ground)  state  of  the 
atom 

r //ipZ  gfl  /o  t;  i  \ 

M  —  —      2ft8     '  l^.o-l-J 

and  the  corresponding  radius 


ft8          1 


(2.52) 
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where 

a0  =  -?L-^  0.529-  ID'8  cm  (2.53) 


is  the  radius  of  the  first  Bohr  orbit. 

The  second  Bohr  postulate  (2.48),  combined  with  Eq.  (2.50)  for 
the  frequency  of  radiation  a>rtn»,  gives 


__  Kn  —  En.  __  OTOZ^»  /I  1  \ 

aw  _  —a-—  -  -jar-  ^T  -  -n,  j  - 


Accordingly,  for  2  =  1,  we  obtain  Balmer's  formula  (2.1).  The  Bohr 
theory  also  relates  the  Rydberg  constant,  established  prior  to  this 
on  a  purely  empirical  basis,  with  the  Planck  constant  ft: 

«  =  -*££.  (2-55) 

This  derivation  of  Balmer's  formula,  with  a  value  for  the 
Rydberg  constant  that  agreed  with  experiment,  was  one  of  the 
greatest  achievements  of  the  Bohr  theory.  In  spite  of  this  success, 
however,  the  Bohr  theory  has  a  number  of  inherent  defects,  which 
became  increasingly  important  in  its  further  development. 

In  the  first  place,  the  Bohr  theory  was  obviously  semiclassical 
in  nature.  In  addition,  the  Bohr  theory  could  be  used  to  compute 
only  the  frequency  of  the  spectral  lines,  and  not  their  intensity.  To 
find  the  intensity,  it  was  necessary  to  resort  to  classical  electro- 
dynamics, on  the  basis  of  the  so-called  correspondence  principle.3 
Finally,  the  Bohr  postulates  could  not  be  used  to  construct  a  satis- 
factory theory  of  multielectron  atoms,  including  that  of  helium, 
which  possesses  only  two  electrons. 


According  to  the  correspondence  principle,  in  the  limit,  all  the  results  of  a  previous 
theory  (here,  i  lassical  electrodynamics)  should  follow  from  a  new  theory  (here,  the  Bohr 
theory). 

For  instam  e,  for  //  *  0  (or  at  the  limit  ot  large  quantum  numbers),  tho  results  of  quan- 
tum mechanics  should  appioach  the  classical  results  In  exactly  the  same  way,  when 
ft  >  0,  the  results  of  the  lelativistic  theory  should  approach  the  nonrelativistic  results, 
and  so  on.  Thus,  the  correspondence  principle  enables  us  to  check  the  extension  of  a 
theory  by  requiring  it  to  reduce  to  the  classical  picture  at  the  limit. 

In  the  initial  stages  ot  a  new  theory,  when  it  cannot  yet  be  used  to  investigate  certain 
phenomena,  the  correspondence  principle  may  be  used  for  reasonable  extensions  of  the 
predictions  of  the  old  theory  to  the  new  theory  Thus,  for  example,  the  Bohr  theory  could 
be  used  to  calculate  the  frequency  oi  the  radiation,  but  not  its  intensity  Bohr  was  able 
to  determine  the  intensity  from  the  correspondence  principle,  according  to  which  the  only 
changes  ot  the  quantum  number  n  that  were  allowed  (the  selection  rules  _\«  -  n  -  n')  were 
those  that  coincided  with  the  classically  allowed  harmonics  of  the  radiation.  The  quantum 
theory  of  radiation  of  light  was  completed  only  with  the  development  of  quantum  electro- 
dynamic  s. 
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Accordingly,  we  shall  not  discuss  in  greater  detail  the  subsequent 
history  of  the  Bohr  theory.  This  theory  represented  only  an  inter- 
mediate stage  in  the  development  of  quantum  mechanics,  a  theory 
which  can  be  used  to  determine  both  the  frequency  and  intensity  of 
the  radiation  of  atoms.  However,  we  thought  that  at  least  a  brief 
discussion  of  the  basic  principles  of  the  Bohr  theory  was  advisable, 
for  this  theory  still  retains  considerable  heuristic  significance. 
In  particular,  the  Bohr  theory  often  serves  as  the  starting  point  in 
the  analysis  of  many  results  related  to  quantization. 

The  conclusions  that  follow  from  Eq.  (2.54)  will  be  discussed  in 
Chapter  13,  where  the  problem  of  the  hydrogen-like  atom  will  be 
solved  quantum-mechanically. 


Using  the  Bohr  theory,  quantize  a  hydrogen-like  atom  for  the  case  of 
is,  find  the  spectrum  of  the  energy  levels).  Show  that,  in  the  non- 
relativistic  approximation,  the  coordinates  r  and  y  have  the  same  frequency  of  variation; 
that  is,  the  motion  is  periodic.  Show  that  the  formula  for  the  frequency  of  the  radiation 
of  an  atom  remains  the  same  as  in  the  case  of  circular  orbits* 

What  new  feature  is  introduced  in  the  theory  by  considering  elliptic  orbits? 
Solution,    Using   condition  (2.40)   and  noting   that,  in  this  problem,  the  generalized 
momenta  corresponding  to  the  <f  and  r  coordinates  are 


p  =  //J0r2<j>  =  const  and  pr  =5  //iflr  =  I/  2//io£+  - 
we  find 


rmax 


()  'nun 

There  the  second  integral  was  calculated  from  the  formula 


min 


Hence  the  energy  E  is 


By   direct  differentiation  of  this   equation  with   respect  to  the  variables  lr  and  /f,  it  is 
easily  shown  that  the  frequencies  with  which  the  r  and  y  coordinates  change 

dE  dll 


coincide  with  one  another.  Using  the  Bohr  quantization  rule  (2.47),  we  replace  the  adiabatic 
invariants  /r  and  /f  by  nrh  and  n^h,  respectively,  obtaining 
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where  n~nr~\-riy,  nr  =  0, 1, 2, 3, . . .  and  n^  =  1, 2, 3,. . .  This  expression  is  completely 
identical  with  Eq.  (2.50),  which  was  derived  for  the  case  of  circular  orbits;  therefore, 
the  radiation  frequencies  remain  the  same  as  before. 

States  with  a  given  nr  and  different  n^  have  the  same  energy,  but  differ  in  eccentric- 
ity c: 


In  particular,  when  n^  =  n  ,  we  have  a  circular  trajectory.  As  n^  decreases,  the 
elliptic  trajectory  becomes  more  elongated. 

The  new  feature  to  appear  in  the  case  of  elliptic  orbits  is  the  selection  rules  A/^r^  -h  1, 
A«  =  Ht  1,  ±  2,  ±3,...  .  These  account  for  the  appearance  of  series  that  can  also  be 
found  from  the  correspondence  principle.  In  the  case  of  circular  orbits,  Aw  =  bn^  =  ±.  1 
(see  also  Chapter  13). 


I  Find  the  classical  equation  of  motion  and  the  trajectory  of  an  electron 
according  to  relauvistic  theory.  Show  that,  in  this  case,  the  r  and  <p 
coordinates  change  with  different  frequencies;  that  is,  the  motion  is  quasi-periodic. 
Determine  the  angle  through  which  the  perihelion  of  the  electron  is  shifted  during  "one" 
revolution.  Obtain  a  formula  for  the  energy  levels  and  find  their  splitting.  Compare  the 
results  with  the  corresponding  nonrelativistic  problem  2.1. 
Solution.  Using  the  reiativistic  Lagrangian  function 


(2.57) 

f 

where 


we  obtain  the  equation  of  motion 

d        ni0v 


dt 


r. 


/)  V  1  *^ 

From  Eq.  (2.57)  we  determine  the  generalized  momenta /i(p  =  --.  and^,=— T-. 
Since  /;,.  =  ~J-  r\  where  r(  ^=s  ~- ,  it  follows,  in  accordance  with  the  law  of  conservation 
of  energy,  that 

E==C\/    m*C*  ^P*^  >s  """  ~f"  ~  COIlbt» 
which  implies  that 


Hence  we  obtain  the  equation  of  the  trajectory 

r==   1+icos-re  *  (2'58) 
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where 


q-~Ze\E* 


/,/,         WfiC4\ 
.+4^ 


It  is  apparent  from  Eq.  (2.58)  that  the  motion  is  quasi-periodic.  For  the  shift  Ay  of 
the  perihelion,  we  have  from  (2.58) 


With  the  help  of  (2.40)  and  (2.56)  we  get 


where 


Then,  for  the  energy  /:,  we  obtain  the  expression 

Z*e* 


£=/fI0C»      1  +  -.-- 


4*' 


(2.59) 


From  this  it  is  evident  that  the  frequencies  to^—  ^.    and  ay  =  -.—  will  be  different. 

y/(p  y/r 

Using  the  Bohr  quantization  rule  (2.47),  we  transform  Eq.  (2.59)  to 

B       -  if  -  «^  -  «. 


e*          1 
where  a  ^=  -i  J  G^  .^  is  the  fine-structure  constant. 

Expanding  the  formula  (2.60)  into  a  series  in  a2  and  restricting  ourselves  to  quantities 
of  the  order  of  a*f  we  have 

n       3 


Since  MV  varies  from  1  to  wt  it  follows  from  Eq.  (2.61)  that  the  energy  levels,  which 
determined  by  the  principal  quantum  number  /i  =  /ir-j-  /i9t  are  split  into  n  closely 
ed  sublevels  (this  close  spacing  is  a  consequence  of  the  small  ness  of  or). 


are 
spaced 
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The  fine-structure  splitting  is 

US        =F        -E  -      ***** 

^n-  \      l-«.  \      ***.  "f  -  l  ~  n\  (n9  '-{)• 

The  splitting,  or  fine  structure,  of  the  levels,  is  a  characteristic  result  of  relativistic 
effects  and  essentially  distinguishes  the  predictions  of  relativistic  theory  from  those  of 
nonrelativistic  theory  (see  Problem  2.  IX 


roblem  2.3.  JAn  electron  is  located  in  a  central  field 


Determine  the  values  of  s  at  which  stable  states  of  the  system  are  possible. 
Answer,  s  <C  2. 
Hint.  Use  the  expression  for  the  effective  potential  energy 

2/rtS*  ~~  r* 
and  make  use  of  the  fact  that  stable  motion  is  possible  if  l/eff  has  a  minimum. 


fcoblem  2.4    Find  the  scattering  cross  section  for  nonrelativistic  electrons  by  nuclei 
(Coulomo  point  charges).  Compare  the  result  with  the  scattering  cross  section  for  alpha 
particles. 
Answer. 


2 


A 

_roblem  2.5J  Show  that  in  the  relativistic  scattering,  unlike  the  nonrelativistic  case, 
electroWCftfl  be  captured  by  nuclei.  Obtain  the  total  capture  cross  section. 

Solution.  According  to  nonrelativistic  theory,  the  trajectory  of  electrons  in  the 
Coulomb  field  of  a  nucleus  is  given  by  the  equation 


1  -f-  £  COS  cp  » 

where 


At  /•  >  Of   this  equation  describes  hyperbolic  motion,  and  therefore  the  capture  of  elec- 
trons is  impossible. 

In  relativistic  mechanics,  the  equation  of  the  trajectory  has  the  form  of  Eq.  (2.58). 
%*e\ 
If  "c~y  >  1  in  this  equation,  7   becomes  imaginary.  Therefore,  r  —  Ofor  <p  -*  co;  that  is, 

the  particle  falls  to  the  center,  which  means  it  is  captured. 

Since  the  angular  momentum  p^  of  the  electron  is  related  to  the  initial  momentum  p 
and  the  impact  parameter  b  by  the  expression  />9=/^,  the  condition  for  capture  can  be 

written  in  the  form  b~  ^:  -^  '-" .   Hence  the  total  capture  cross  section  is 


°cap 


Chapter  3 

Wave  Properties  of  Particles 

A.    DE  BROGLIE  WAVES 


As  we  mentioned  in  Chapter  1,  the  development  of  modern 
quantum  theory  began  with  the  discovery  that  light  has  particle 
properties  in  addition  to  wave  properties  (characterized  by  the 
wavelength  X  and  the  frequency  o>).  The  energy  e  and  momentum  JT 
of  a  quantum  of  light  (photon)  were  established  by  Einstein  as 

e  =  ft«,  =  Av,     rt«=ftft=J«D.  (3.1) 

Analyzing  these  equations,  the  French  physicist  de  Broglie  sug- 
gested that  they  could  be  generalized  to  apply  to  ordinary  particles 
and,  in  particular,  electrons.  Generally  speaking,  de  Broglie  as- 
sumed that  the  wave-particle  duality  is  not  an  exclusive  property  of 
light,  but  is  also  a  characteristic  of  electrons  and  all  other  parti- 
cles.1 Accordingly,  a  beam  of  free  electrons,  whose  relative  energy 
E  and  momentum  p  are  related  to  the  velocity  v  by  the  equation2 

E  =  -^£=-,    p  =  -W-9  (3.2) 

/l-p«       ^       /I-?' 

should  also  exhibit  wave-like  properties.  The  corresponding  fre- 
quency and  wave  number  were  defined  by  equations  similar  to 
Einstein's: 

E  =  Aw  and />  =  **.  (3.3) 


This  hypothesis  was  made  by  de  Broglie  with  a  twofold  purpose:  first,  to  provide  a 
physical  basis  for  the  Bohr  quantization;  second,  to  explain  the  first  experiments  on  elec- 
tron diffraction  (see  below). 

From  now  on,  we  shall  leave  it  to  the  reader  to  distinguish  between  the  relativistic 
energy  (which  includes  the  rest  mass  energy)  and  the  nonrelativistic  energy.  Only  in 
cases  in  which  the  two  energies  appear  in  the  same  equation  will  they  be  distinguished 
by  some  kind  of  index,  for  example, 

\  m0c2 
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Consequently,  the  de  Broglie  wavelength  of  the  moving  particles  is 

X=^-  =  A  (3.4) 

k       P- 

The  de  Broglie  relations  (3.3)  thus  generalized  Einstein  equations 
(3.1),  derived  from  the  photon  theory.  These  now  became  equally 
applicable  to  the  analysis  of  light  in  terms  of  its  particles  and 
of  moving  electrons  in  terms  of  their  wave  properties.  It  is  worth 
noting  that  the  dual  character  of  particles  and  light  disappears 
if  Planck's  constant -ft  is  allowed  to  go  to  zero  (the  correspondence 
principle). 

Taking  de  Broglie's  equations  (3.3)  as  the  basis  of  discussion, 
we  may  describe  the  motion  of  free  particles  (along,  say,  the  x 
axis)  by  the  so-called  wave  function,  which  for  this  particular  case 
is  analogous  to  that  of  light  and  represents  a  plane  wave: 

^(x,t)  =  Ae'2rt^'^  =  Ae"^-^  =  Ae'^(l!t''p^.  (3.5) 

From  the  standpoint  of  Eq.  (3.5),  we  can  attempt  to  explain  Bohr's 
postulate  of  stationary  states  [see  Eq.  (2.47)].  The  physical  inter- 
pretation which  we  can  offer  is  as  follows:  the  only  allowed  circular 
orbits  are  those  which  are  divisible  by  an  integral  number  of 
de  Broglie  wavelengths,  that  is, 

2T  =  ».  (3.6) 

Indeed,  the  wave  function  is  single- valued  only  when  this  condition 
is  satisfied.  Furthermore,  since  in  the  nonrelativistic  case 

\=  — 

mQv  ' 

Eq.  (3.6)  yields  the  Bohr  condition  for  stationary  states: 

/in.  (3.7) 


B.    EXPERIMENTAL  OBSERVATION  OF  THE 
WAVE  PROPERTIES  OF  PARTICLES 

To  investigate  the  wave  properties  of  electrons  and  to  prove 
that  electrons  have  a  specific  wavelength  X,  it  is  first  of  all  neces- 
sary to  obtain  a  monochromatic  electron  beam  (a  beam  of  electrons 
moving  with  the  same  velocity).  One  possible  way  of  producing 
such  a  beam  is  by  means  of  an  "  electron  gun"  (see  Fig.  3.1), 
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which   emits    electrons    with  a  certain  definite  velocity  v.    In  the 
nonrelativistic  case  (u<c),  this  velocity  is  given  by  the  equation 


300' 


(3.8) 


where  <D  is  the  accelerating  potential  (in  volts)  of  grid  A  relative 
to  the  cathode  C.  Thus,  after  passage  through  grid  A  (at  which 
point  no  further  acceleration  is  imparted),  the  electrons  will  have 
a  de  Broglie  wavelength  of 


X  =  —  =  -          -  =  ---  — 
~~        ~  ~ 


(3.9) 


To  exhibit  the  wave  properties  in  the  most  pronounced  form, 
one  must  impart  to  them  the  longest  possible  wavelength  ^. 
This  can  be  done  by  decreasing  <1>.  Since,  however,  a  certain 
amount  of  energy,  known  as  the  work  function,  is  expended  in  the 
ejection  of  electrons  from  metal  (this  energy  is  of  the  order  of 
several  ev  and  gives  rise  to  a  certain  spread  in  the  electron  veloc- 
ities), the  smallest  potential  $  at  which  the  beam  will  be  relatively 
monochromatic  is  15-20  volts.  At  these  conditions,  the  de  Broglie 
wavelength  of  the  electrons  is  approximately  the  same  (\~  10~8  cm) 
as  the  wavelength  of  soft  x-rays. 

Before  we  consider  the  physical  nature  of  the  wave  -like  charac- 
ter of  an  electron  beam,  let  us  discuss  several  experiments  that 
have  led  to  the  direct  detection 
of  de  Broglie  waves.  To  begin 
with,  we  shall  take  the  elec- 
tron diffraction  experiments  of 
Davisson  and  Germer ,  who  were 
the  first  to  make  experimental 
observations  of  electron  waves. 
Since  the  de  Broglie  wavelength 
of  an  electron  is  of  the  order 
of  10  8  cm,  they  proceeded  in 
the  same  way  as  for  soft  x-rays, 
using  a  crystal  with  a  lattice 
constant  of  the  order  of  10  8  cm  for  the  diffraction  grating.  The 
set-up  of  the  Davisson-Germer  experiments  is  depicted  in  Fig. 
3.2.  After  coming  out  of  the  electron  gun,  the  beam  impinges 
perpendicularly  on  the  surface  of  the  crystal,  where  the  electrons 
are  scattered  at  various  angles  by  the  surface  lattice.  Since  the 
penetration  of  the  electrons  into  the  crystal  can  be  neglected,  the 
diffraction  grating  can  be  regarded  as  two-dimensional.  Accord- 
ingly, the  position  of  the  diffraction  maxima  can  be  determined 
from  the  condition  that  the  path  difference  s  =  dsin&  (d  is  the 


Fig.  3. 1    An  electron  gun  that  produces  a 

monochromatic  beam  of  electrons. 
C  is  the  cathode,  and  A  is  the  anode. 
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two-dimensional  lattice  constant  and   t>  is  the  scattering  angle)  is 
equal  to  an  integral  number  of  wavelengths  (see  Fig.  3.3): 

d  sintt,,  —  //A  =  n  — ---— ,  (3.10) 


where  the  integer  n  is  the  order  of  the  given  diffraction  maximum. 
The  diffraction  maxima  were  detected  with  a  galvanometer,  which 
registered  the  intensity  of  the  beam  of  scattered  electrons  at 
different  angles  (see  Fig.  3.2). 
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Fig.    3  2     Diagram    of   the    Davisson-Genw»r 
experiments  on  electron  diffraction 

A  second  prediction  of  the  de  Broglie  theory  was  also  verified: 
namely,  the  prediction  that  as  the  potential  (l>  increases,  the  angle 
ft,,,  cor  responding  to  the  nth  diffraction  maximum,  will  decrease  in 
accordance  with  Eq.  (3.10).  Thus  the  correctness  of  de  Broglie' s 

equations  was  completely  confirmed  by 
these  investigations. 

It  is  well  known  that  x-ray  diffrac- 
tion patterns  are  produced  not  only 
by  single  crystals,  but  also  by  poly- 
crystalline  formations.  This  was  shown, 
for  example,  by  Debye  and  Scherrer. 
Tartakovskiy  and  Thomson  obtained  dif- 
fraction patterns  similiar  to  these  x-ray 
diffraction  patterns  by  extending  this 
technique  to  electron  waves  and  passing 
an  electron  beam  through  a  foil.-1  The 
following  theoretical  explanation  can  be 
given  for  the  diffraction  pattern  of  electrons.  The  electron  beam, 
with  an  energy  of  several  thousands  or  even  tens  of  thousands  of 
electron  volts,  impinges  upon  a  polycrystalline  foil,  where  it 


Tip.   \\.\\    Electron  scattering 
by  a  two-dimens tonal  diffrac- 
tion grating 


*  To  reduce  the  absorption  of  electrons,  a  relatively  thin  foil  (thickness  of  the  order  of 
10       cm)  was  used 
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encounters  single  crystals  and  is  reflected  from  them.  The  path 
difference  s  of  two  rays  (see  Fig.  3.4)  is  related  to  the  lattice 
constant  dot  the  three-dimensional  crystal  by  the  relation 


(3.11) 


where    <p  is  the  angle  between  the  ray  and  the  lattice  plane.  Since 
the    single  crystals  of  the  foil  are  oriented  at  random,  a  ray  can 
leave    the    foil   at   any   angle   with 
respect    to    the   original   direction 
(see  Fig.  3.5).  Among  the  crystals, 
there  will  be  some  that  are  oriented 
at  just   the    right   angle   to   satisfy 
Bragg's  law 


2d  sin  <p  =  n\  =  n  ~- ,  (3.12) 


g.    3  I    Reflection  from  a   three- 
dimensional  crystal  lattice 


where  n     is    an   integer.    Whenever 
this  is  the  case,  a  diffraction  maxi- 
mum occurs  and  a  bright  spot  Q  is  found  on  the  screen.  Since  the 
experimental  apparatus  is  cylindrically  symmetric,  the  bright  spots 
form  diffraction  rings,  whose  radiuses  Rn  can  be  found  from  the 
relation  (see  Fig.  3.5) 

tan  2ff== — ,  (3.13) 

where  L  is  the  distance  from  the  screen  to  the  polycrystalline  foil. 
Since  the  angle  <p   is  very  small  in  these  experiments  (<p^sin< 
tan  ?),  Eqs.  (3.12)  and  (3.13)  yield 


L    12-  10-' 


(3.14) 


that  is,  at  constant  L,  d  and  n, 


(3.15) 


These  relations  were  completely  confirmed  by  Thomson's  experi- 
ments. Electron  diffraction  patterns  and  Debye-Scherrer  x-ray 
diffraction  patterns  are  now  widely  used  in  studies  of  crystal 
structure. 

It  is  worth  noting  that  de  Broglie's  formula  does  not  only  apply 
to  electrons  and  other  elementary  particles  such  as  protons  and 
neutrons,  but  also  to  complex  nuclei,  multielectron  atoms,  and 
even  to  molecules.  True,  their  de  Broglie  wavelength  is  very  small 
because  of  their  relatively  large  mass.  Nevertheless,  Stern  and 
Esterman  have  succeeded  in  observing  the  diffraction  of  beams  of 
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helium   atoms   and  hydrogen   molecules    in  reflections  from  LiF 
crystals. 


Polycrystalline 
foil 


Pencil  of 
electrons 


Screen 


Fig.    3.5.    Diffraction   of   electron   waves    in   a 

polycrystalline    substance    (the    Tartakovskiy- 

Thomson  experiments) 


A  method  involving  neutron  diffraction  has  been  found  extremely 
useful  in  analyzing  the  structure  of  substances.  Neutrons  have  no 
electric  charge  and,  therefore,  pass  freely  through  matter  even  at 
low  energies  (thermal  neutrons),  when  their  de  Broglie  wavelength 
is  relatively  large.  All  these  facts  provide  convincing  evidence 
that  wave  properties  are  displayed  in  some  degree  or  other  by  all 
particles  and  that  the  de  Broglie  formula  for  X  is  of  universal 
validity. 

The  analogy  between  light  waves  and  electron  waves  has  led  to 
the  development  of  a  new  branch  of  physics,  electron  optics,  which 
is  devoted  to  the  study  of  wavelike  processes  in  electron  beams. 
With  the  help  of  this  new  science,  it  has  been  possible  to  design 
and  construct  electron  microscopes,  which  have  found  wide  appli- 
cation in  modern  techniques.  In  ordinary  optical  microscopes,  the 
upper  limit  of  the  resolving  power  (and,  consequently,  the  magnifi- 
cation) is  of  the  order  of  the  wavelength  of  the  light  which  is 
utilized.  To  achieve  the  highest  possible  magnification,  it  is  neces- 
sary to  reduce  the  wave  length  of  the  light.  The  wavelength,  however, 
cannot  be  made  arbitrarily  small:  it  is  impossible,  for  instance,  to 
construct  an  x-ray  microscope,  since  no  appropriate  lens  exists 
for  x-rays.  On  the  other  hand,  satisfactory  electric  and  magnetic 
lenses  can  be  produced  for  electron  waves.  Thus,  the  electric 
lens  consists  of  a  capacitor  which  has  an  aperture  in  the  middle 
of  the  plate,  while  the  magnetic  lens  consists  of  ordinary  magnetic 
coils.4 


Modern  optical  microscopes  give  a  magnification  of  approximately  one  or  two 
thousand.  An  electron  microscope  can  give  a  magnification  of  more  than  100,000.  Besides 
the  electron  microscope,  the  proton  microscope  is  also  widely  used  at  present.  Its  re- 
solving power  can  be  made  even  greater  than  that  of  the  electron  microscope. 
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C.    WAVE  PACKETS.     GROUP  AND  PHASE  VELOCITIES 

From  de  Broglie's  hypothesis  it  follows  that  the  motion  of  a 
free  material  particle  having  an  energy  E  =  mc*  and  a  momentum 
p  =  mvt  can  be  described  by  means  of  the  plane  wave  (3.5).  The 
velocity  u  of.  the  de  Broglie  wave  can  be  found  as  the  time  rate  of 
displacement  of  a  constant  phase 

Et~px  =  const;  (3.16) 

that  is,  the  phase  velocity  is5 

„_**_  E  __c2  /Q  17\ 

M=-JT= =  .  I  U.I  f   I 

dt        p        v  x          ' 

According  to  the  theory  of  relativity,  the  velocity  of  the  particle  v 
cannot  exceed  that  of  light  in  vacuum  (c).  However,  the  calculated 
phase  velocity  of  the  wave  appears  larger  than  c.  This  would  indi- 
cate that  it  is  theoretically  impossible  for  a  monochromatic  wave 
to  transport  a  particle  or  carry  energy,  since  all  particles  and 
energy  must  travel  with  a  velocity  smaller  than  that  of  light,  in 
accordance  with  the  principle  of  relativity. 

This  equation  can  be  obtained  from  the  following  simple  considerations.     The  wave 

function  depends  only  on  the  phase  0  —  (Et  —  px)   -,   therefore,   at  the  time  ^  =  t  +  Af, 

h 
this  phase  moves  to  the  point  x^  -  t  f  Ax,  which  can  be  found  from  the  equation 

Et1  -  pxl    —    Et  —  px    =    constant. 
Thus 

EA  t  -  p  A  TC   =   0 

Hence  the  rate  of  propagation  of  the  constant  phase  (which  is  also  the  velocity  of  the 
wave  as  a  whole,  as  can  be  seen  from  Fig.  3.6)  is 

Ax         E 

u    -    — —  -    —  . 

A*       p 


Position  of  the 
&*         wave  at  time  t 


the  wave  at 
time  tf-t+At 

Fig.  3.6.  The  propagation  of  a  monochromatic  wave. 
During  the  time  A(,  the  wave  as  a  whole  moves  a  dis- 
tance Ax.  The  phase  velocity  of  the  wave  is  

At 
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An  escape  from  this  dilemma  was  found  in  the  early  stages  of 
the  development  of  quantum  mechanics.  This  solution  retained  the 
wave  properties  of  particles,  which  had  received  such  striking 
experimental  confirmation.  Thus,  each  particle  was  associated 
with  a  group  of  waves  of  nearly  equal  frequencies,  rather  than  with 
a  single  monochromatic  wave.  This  was  further  suggested  by  the 
fact  that  the  diffraction  lines  of  electron  waves  were  always  ob- 
served to  have  a  certain  definite  width.  Thus,  it  seemed  that  several 
waves  with  very  nearly  equal  frequencies  were  diffracted,  rather 
than  an  individual  wave.  Yet  another  basis  for  this  solution  was 
provided  by  spectroscopic  studies,  which  showed  that  all  spectral 
lines  are  also  characterized  by  a  certain  definite  width. 

One  advantage  which  results  from  using  a  set  of  waves  with 
nearly  equal  frequencies  rather  than  an  individual  monochromatic 
wave  is  that  it  is  always  possible  to  construct  a  wave  packet  whose 
resultant  amplitude  is  appreciably  different  from  zero  only  in  a 
certain  small  region  of  space.  This  small  region  of  space  can  be 
associated  with  the  position  of  the  particle. 

On  the  basis  of  these  considerations,  let  us  attempt  to  describe 
the  motion  of  a  particle  by  constructing  a  wave  packet  out  of  a 
continuous  set  of  waves,  assuming  that  the  momentum  //,  which  is 
connected  with  the  wave  number  U  by  the  de  Broglie  relationship 

k'  =  p'/ft9  ranges  from  /;  —  \     to  p  Jr  ~  (Ap  ^p).  We  shall  take  the 
amplitudes  A  (//)  of  the  individual  waves  to  be 


.  0.18) 

0, 


The  resultant  wave  function  ty  is 


(3-19) 


In  this  equation,  let  us  change  from  the  variable  p'to  the  variable 
;/„-;/  —  ;)'  ~fj-'^p"  ^^  ,  dp'  =  dp'\  and  expand  £'  in  a  series 
about  the  central  point  p'  —  p: 


(3.20) 
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Restricting  ourselves  for  present  purposes  to  the  first-order  terms 
(terms  proportional  to  p"  ~  A/;)  and  integrating  Eq.  (3.19)  with 
respect  to  dp",  we  obtain 


where  the  amplitude  of  the  wave  packet  is 


dE 

(3.22) 

From   this    equation,    it   follows    that   B  does  not  remain  constant 
either  in  space  or  in  time. 

To  determine  the  velocity  of  the  wave  packet  as  a  whole,  that  is, 
the  group  velocity  //"  corresponding  to  the  motion  of  some  specific 
amplitude,  we  shall  use  the  same  procedure  as  for  the  phase 
velocity.  Let  us  take  a  certain  constant  value  of  the  amplitude  such 
that 


(3.23) 

We  then  obtain  the  following  equation  for  u: 

-  =  <*x  =  M  (3.24) 

dt       dp  * 

Since  E  =  c  Y  P*  +  mfc1  fc>r  a  free  particle,  and 

• 

dE       c-p 
ft  =*-£-•  (3-24a) 

we  find 

(3.24b) 


which  shows  that  the  group  velocity  it  of  the  wave  packet  as  a  whole 
is  exactly  equal  to  the  velocity  v  of  the  particle  itself. 

-  From  the  expression  for  the  amplitude  B(^)  =  A-~-  of  the  wave 


packet  at  time  /  =  0  (when  the  amplitude  corresponds  to  £  =  ,-—  *;  see 

Fig.  3.7),  it  is  readily  seen  that  the  maximum  value  of  this 
amplitude  B(Q)  =  A  lies  at  the  point  *  —  0.  At  all  other  points  cor- 
responding to  the  relative  maxima  the  amplitude  is  smaller.  In 
particular,  considering  different  values  of  the  argument  £,  we  have 
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At  the  point  S  =  TC,  the  amplitude  vanishes  (£(rc)  =  0).  Consequently, 
at   t  =  0  the  wave  packet  may  be  regarded  as  localized  in  the  region 

of  the  first  maximum,  that  is,  in  the  region  A?  =^  A*~it.    Hence, 
we  obtain  the  relation 

^h,  (3.25) 


which  is  known  as  the  uncertainty  principle. 


Group  velocity 


-1 


Fig.  3.7.  Form  of  the  wave  packet  at  t  =  0. 

Since  the  center  of  mass  of  the  Wave  packet  (that  is,  its  principal 
maximum)  travels  with  the  velocity  of  the  particle  (u  =  v),  the  wave 
packet  describes  the  localization  of  the  particle.  In  particular, 
it  follows  from  other  hypotheses  that  have  been  made  that  the 
position  of  the  particle  is  characterized  by  the  square  modulus  of 
the  amplitude  of  the  x  wave,  namely, 


Consequently,  the  quantities  A*  and  Apmay  be  regarded  as  a  meas- 
ure of  the  accuracy  with  which  it  is  possible  to  compute  the 
momentum  and  position  of  a  particle  in  space  by  means  of  the 
wave  theory. 

It  was  also  necessary  to  determine  whether  <]>  waves  could  be 
identified  with  the  structure  of  particles,  or  whether  these  waves 
characterize  only  their  motion.  The  first  interpretation  of  the 
relationship  between  a  particle  and  its  associated  wave  was  proposed 
by  Schrodinger.  In  terms  of  his  hypothesis,  a  particle  is  a  wave- 
like  formation  and  the  density  of  its  "smearing  out"  over  space  is 
given  by  <!>*<)>. 
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In  theory,  a  group  of  waves  can  always  be  used  to  form  a  wave 
packet  whose  size  is  of  the  order  of  the  radius  of  any  given  particle 
(for  example,  an  electron).  This  representation  of  the  particle  is, 
however,  unstable.  Indeed,  as  follows  from  Eq.  (3.17),  the  phase 
velocity  of  each  of  the  monochromatic  waves  contained  in  the  wave 
packet  depends  on  the  corresponding  wave  number,  or  momentum 
p.  Accordingly,  each  of  these  monochromatic  waves  propagates 
with  its  own  phase 6 


As  a  result,  the  wave  packet  gradually  spreads  out  with  time.  The 
"spreading"  time  is  determined  by  the  time  interval  in  which  the 
initially  disregarded  part  of  the  phase  of  <|>  in  (3.21)  becomes  com- 
mensurate with  *.  According  to  (3.20),  the  neglected  part  of  the 
phase  is  proportional  top'*  ~(Ap)*.  Thus,  from  (3.  20)  we  can  obtain 
a  measure  of  the  time  A/  which  elapses  between  the  initial  forma- 
tion of  the  wave  packet  and  its  distortion: 


Or,  the  time  interval  starting  with  the  formation  of  the  wave  packet 
and  ending  when  the  distortion  of  the  latter  can  no  longer  be 
neglected  is 

A*  --  **  {3-26) 


Let   us    replace    Ap    by    its    value   from    (3.25).  Then,  in  the  non- 
relativistic  case  (p<m0c),  Eq.  (3.24a)  yields 


= 

dp        m0  ' 

££  —  jL 
dp*        m0  * 

Thus  we  obtain  for  the  "spreading"  time  of  the  wave  packet 

(3.27) 


6It  can  be  seen  from  this  that  a  stable  wave  packet  can  be  formed  only  for  a  particle 
which  has  zero  rest  mass  (m0  =  0)  and  which  is  moving  in  a  vacuum  (as  an  example  of 
such  a  particle,  we  may  take  the  photon).  It  is  only  in  this  case  that  the  phase  velocities 
of  all  the  components  of  the  wave  package  are  the  same  regardless  of  the  wave  number. 
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For  a  particle  with  w,,^lg  and  A*~o.l  cm,  the  "spreading" 
time  is  A/~1028  sec,  and  thus  the  wave  packet  does  not  actually 
spread.  In  the  case  of  electrons,  however,  m0^lCT27  g  and 
Ax~io~1J  cm.  Therefore,  the  wave  packet  of  an  electron  begins 
to  spread  out  after  kt ^10  *8  sec,  that  is,  almost  instantaneously. 
Thus,  the  electron  is  not  a  stable  formation  in  the  Schrbdinger 
theory  of  the  "smeared"  electron.  This  is  in  obvious  contradiction 
with  the  experimental  facts.  Moreover,  it  becomes  impossible  to 
explain  the  phenomenon  of  diffraction  if  a  monochromatiq  wave, 
which  provides  an  appropriate  description  of  the  motion  of  several 
electrons,  is  replaced  by  a  set  of  several  wave  packets. 

At  present,  another  approach  has  been  adopted,  namely,  Max 
Horn's  statistical  interpretation  of  the  wave  function  (the  quantity 
| ';>  |2™ '!>*<);)  as  the  probability  density,  or  probability  of  finding  the 
electron  at  various  points  in  space.  The  statistical  interpretation 
is  not  concerned  with  the  structure  of  the  electron,  and  the  electron 
can  remain  a  point  charge  (or,  to  be  more  precise,  a  charge  whose 
radius  rQ  does  not  exceed  10  13  cm).  Only  the  probability  of  finding 
the  electron  at  different  points  in  space  changes  as  the  wave  func- 
tion changes  with  time,  but  the  structure  of  the  electron  remains 
completely  unaffected. 

From  the  standpoint  of  Horn's  statistical  interpretation,  the 
quantum-mechanical  treatment  of  problems  involving  many  elec- 
trons in  identical  states  presents  certain  methodological  similarities 
with  the  treatment  of  various  problems  in  the  kinetic  theory  of 
gases  on  the  basis  of  Maxwell's  distribution  function  (it  should  be 
noted,  however,  that  the  quantum-mechanical  distribution  function 
/  =  '],*<<) is  devoid  of  the  temperature  term).  For  example,  in  Horn's 
interpretation,  the  diffraction  pattern  of  an  electron  beam  may  be 
explained  in  the  following  manner.  The  bright  spots  correspond  to 
the  maxima  of  the  function  f=  ty  2,  and  thus  the  greatest  numbers 
of  electrons  travel  towards  bright  spots.  On  the  contrary,  the 
probability  is  smallest  for  electron  motion  in  the  direction  of  dark 
regions. 

One  attempt  to  explain  a  certain  "freedom  of  behavior"  of  an 
individual  electron  is  based  upon  the  complementarity  principle, 
a  solution  adopted  by  Bohr,  Heisenberg,  and  others  (see  also 
Chapter  7).  The  complementarity  principle  asserts  the  theoret- 
ical impossibility  of  extending  our  knowledge  of  the  microscopic 
world  beyond  a  certain  finite,  even  though  small,  limit  of  accuracy, 
since  our  measuring  apparatus  necessarily  must  exert  certain 
indeterminable  effects  on  the  experimental  object  (an  electron, 
for  example).  In  particular,  for  canonically  conjugate  quantities, 
such  as  the  position  and  momentum,  the  degree  of  accuracy 
with  which  they  can  be  measured  is  given  by  the  uncertainty 
principle. 

Heisenberg  attempted  to  provide  a  more  rigorous  foundation  for 
the  complementarity  principle  by  means  of  the  following  hypothetical 
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experiment.  Suppose  we  wish  to  determine  the  position  of  an  elec- 
tron with  the  aid  of  an  "ultramicroscope,"  that  is,  an  instrument 
that  is  designed  for  precisely  this  task  and  that  utilizes  a  light 
beam  of  appropriate  wavelength  X  (it  is  not  actually  possible  to 
construct  such  a  microscope).  We  shall  assume  that  the  electron 
is  located  at  the  vertex  of  a  cone  of  revolution  with  angle  2?,  (the 
objective  being  the  base),  where  <f>c=.the  angle  between  the  incident 
(wavelength  X)  and  scattered  light  beams.  Any  light  that  enters 
the  objective  after  being  scattered  by  the  electron  must  have 
traveled  within  this  cone  of  directions.  From  the  laws  of  optics, 
it  follows  that  the  uncertainty  in  the  determination  of  any  of 
the  electron  coordinates  in  the  plane  parallel  to  the  plane  of  the 
objective  is 


(3.28) 


Moreover,    since   the    light   has    a   momentum  P  =  ~i*  part  of  this 

momentum  is  imparted  to  the  electron  (the  Compton  effect).  Ac- 
cordingly, the  momentum  of  the  electron  in  the  given  direction  can 
only  be  determined  with  an  uncertainty  of 

(3.29) 


The    product   of  these   two   uncertainties   yields   the   uncertainty 
relation. 

From  (3.28),  it  is  seen  that  to  determine  the  position  of  the 
electron  with  the  maximum  degree  of  accuracy,  it  is  necessary  to 
use  light  of  the  smallest  possible  wavelength  X.  Equation  (3.29) 
shows,  however,  that  the  smaller  the  wavelength,  the  greater  is  the 
momentum  imparted  to  the  electron  (the  position  and  momentum 
are  canonically  conjugate  quantities).  Thus,  according  to  the  com- 
plementarity principle,  there  would  have  to  be  two  classes  of 
experimental  aparatus.  One  of  these  would  be  designed  to  measure 
the  spatial  coordinates  with  any  desired  degree  of  accuracy  (in  the 
case  of  "ultramicroscopes,"  instruments  with  X->0),  the  other,  to 
measure  the  corresponding  momenta  (instruments  with  X-*^). 
Then  the  observer  using  an  instrument  of  the  first  class  imparts 
all  of  the  indeterminable  effects  to  the  momentum,  whereas  the 
instrument  of  the  second  class  imparts  all  of  the  indeterminable 
effects  to  the  position.  Accordingly,  in  the  opinion  of  the  adherents 
of  the  Copenhagen  school,  we  cannot  at  any  given  time  ascertain 
both  the  position  and  the  momentum  of  the  object,  although  each  of 
these  variables  can  be  measured  separately  with  any  desired  degree 
of  accuracy. 
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L    Problem  3.1  J  Show  that  there  is  an  integral  number  of  de  Broglie  wavelengths  in  an 
eSfpSFetecB&lrorblt. 

Solution,   According  to  Bohr,  the  set  of  stationary  elliptic  orbits  may  be  found  from 
the  two  conditions 


0)  pr  dr  =  nrh    and    (p 


where 


Since  27*=^  to,  that  is, 

T 

2  f  r  ^  =  ^ 

<f  ft 


it 

2 


o 
where 

n=>n?  +  /iri    and    2Tdt  = 
we  obtain  the  required  result 

O)  y  =  n. 

Problem  3.2/  Show  that  the  wave 


i  3.2^  Show 


L 
2  ' 

is  not  monochromatic  for  finite  values  of  /.  Find  the  range  of  wave  numbers  A&  —  k  —  kQ 
over  which  the  amplitudes  of  the  individual  harmonics  may  be  regarded  as  nonzero. 
Solution.  Let  us  represent  <p  (x)  in  the  form  of  a  Fourier  integral 


00 

<p  (x)  =  f  A  (k)  eittx  dkt 


where  the  amplitude  s  A  (k)  are  given  by  the  formula 

4-00 
4(«  =  i  J  i(x)r**dx. 

-  00 

Substituting  the  expression  for  <p  (x),  we  find 

sin     (k  —  k0)t 


Hence  it  follows  that  the  largest  amplitude  corresponding  to  k  =  k0  is  A  (*0)  =  //2«. 
Although  all  the  other  amplitudes  are  not  equal  to  zero,  in  practice  only  those  amplitudes 
which  are  of  the  same  order  as  A  (k9)  need  be  regarded  as  nonzero.  These  amplitudes 

lie  within  the  range  A*  =  |  k  —  £0  1  ~  -y  .  The  quantity  Aft  «s  2«//  may  be  regarded  as  the 
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"spread"  of  the  wave  vector  due  to  the  finite  width  of  the  wave  packet.  Setting  bk  =  -?- 

71 
and  /  ~  Ax,  we  again  obtain  the  uncertainty  relation: 

A/?  AJC  ^  h. 
Problem  3. 3. \Find  the  mean  velocity  and  the  "spreading*'  time  of  the  wave  packet 

00 

<p  (xt  0  =    \  A  (k)  <?-  '  M  -  *v)  dk  (3.30) 

— oo 

if  the  amplitude  A  (k)  is  in  the  form  of  a  Gaussian  curve: 


Solution.    Substituting  the  expression  for  4(fc)  into  Eq.  (3.30)  and  integrating,  we  find 
f  or  *  =  0 


2 
It  follows  that  the  particle  is  initially  localized  in  the  region  A.\r~  —  .This  in  effect  is 

equivalent  to  the  uncertainty  relation  (3.25). 

To  obtain  the  shape  of  the  wave  packet  at  any  other  instant  of  time,  we  take  into  account 
the  equation 


In  the  nonrelativistic  approximation  (k  <  m0c/ft)t  we  then  have 


Substituting  this  expression  into  Eq.  (3.30),  we  obtain,  after  integration 


where 


Hence  we  have  for  the  probability  density 

?a  (x  -  -• 


where  t>0  = — -  is  the  velocity  of  the  center  of  mass  of  the  wave  packet.  From  this  formula 

/Wo 

it  is  seen  that  the  maximum  of  the  wave  packet,  that  is,  the  point*  =  v0t,  moves  with  the 
velocity  Vo  of  the  particle. 
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The  effective  width  of  the  wave  packet  at  time  t  is  found  to  be 


If  we   substitute  q  ^  -    into  this  formula,  the  "spreading"  time  of  the  wave  packet  is 
expressed  by  a  relationship  which  coincides  with  Eq.  (3.27). 

— -"-"^ 

Problem  3.4.|Show  that  a  damped  oscillatory  motion 

x  (t)  =  e-V  cos  u>0* 

results  in  broadening  of  the  spectral  line. 

Solution.  The  damped  oscillation  can  be  represented  in  the  form  of  a  Fourier  integral 

x  (t)  =  \  a  (w)  cos  <»£  rfo>, 

0 

where 

CO 

2  r  i         Y 

a  (ro)  —  —  \  .v  (t)  cos  (At  dt  = TTT p  • 

7t      f  J  7t      J          I      V**        "  "*0/ 

For  the  largest  amplitude,  we  obtain 


The  amplitudes  with  the  same  order  of  magnitude  correspond  to  frequencies  lying  in  the 
range 


It  is  this  equation  which  represents  the  line  broadening  since,  in  effect,  only  the 
amplitudes  corresponding  to  these  frequencies  differ  from  zero* 

Since  the  damping  coefficient  7  is  connected  with  the  mean  life  of  the  damped  oscilla- 
tions by  the  relation  A£  ~  1/7,  the  above  equation  yields  the  familiar  optical  relationship 
connecting  the  broadening  of  the  spectral  lines  and  the  mean  life  of  the  atom: 


Hence,  since  Aw  —  A£//i,we  obtain  what  is  known  as  the  fourth  uncertainty  relation 

h. 


Chapter  4 

The  Time-Independent  Schrodinger 
Wave  Equation 


Planck's  quantum  theory,  Bohr's  postulates,  and  de  Broglie's 
hypothesis  represented  very  important  steps  in  the  development  of 
the  theoretical  foundations  of  atomic  physics.  However,  they  were 
overshadowed  by  the  discovery  of  a  fundamental  differential  equa- 
tion describing  the  electron  and  accounting  for  its  wave  properties, 
and  the  construction  of  a  theory  accounting  for  the  quantum  nature 
of  radiation.  The  crucial  move  in  this  connection  was  made  by 
Schrodinger  in  1926,  when  he  proposed  a  partial  differential  equa- 
tion that  turned  out  to  be  generally  applicable  to  the  motion  of 
charged  particles  in  the  nonrelativistic  case  (v^c).  This  equation 
represented  a  generalization  of  the  classical  Hamilton-Jacobi 
equation  to  cases  in  which  the  de  Broglie  wavelength  differs  from 
zero.  The  Schrodinger  equation  stands  in  approximately  the  same 
relation  to  the  Hamilton-Jacobi  equation  as  does  wave  optics  to 
geometrical  optics. 


A.     DERIVATION  OF  THE  TIME-INDEPENDENT 
SCHRODINGER  EQUATION 

We  shall  show  how  the  Schrodinger  equation  can  be  obtained 
most  simply.  We  must  insist  that  there  can  be  no  question  of  a 
rigorous  or  general  derivation  of  this  equation,  since  it  is  not,  in 
general,  possible  to  set  up  a  new  theory  entirely  on  the  basis  of 
old  postulates.  We  shall  adopt  a  mode  of  presentation  which  con- 
sists essentially  of  a  reasonable  generalization  of  the  wave  equation 
from  classical  electrodynamics  or  optics 


to  the  case  of  de  Broglie  waves.  Here  ?  is  a  function  describing  a 
wave  disturbance  propagating  with  velocity  u.  If  the  wave  is  mono- 
chromatic, a  solution  to  Eq.  (4.1)  may  be  sought  in  the  form 

(4.2) 
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where  u>  =  2o  is  the  angular  frequency,  and  the  spatial  part  <l>(r)of 
the  wave  function  satisfies  the  equation 

In  this  equation,  we  can  use  a  single  parameter  in  place  of  the  two 
parameters  u>  and  u,  namely,  the  wavelength 

Xr=?-  (4.4) 

We  then  have 

^T*(r)-0.  (4.5) 


From  this  general  equation,  we  can  obtain  a  wave  equation  describ- 
ing the  wave  motion  of  electrons  by  substituting  the  de  Broglie 
wavelength 

X  =  A==?55.  (4.6) 

x       ' 


Using  the  law  of  conservation  of  energy 

fm0  ~^~  ^  ^  aas  ^ !SB  const» 
we  have 

-V(r)].  (4.7) 


Substituting  this  expression  into  (4.5),  we  obtain  the  time  -independent 
(or  stationary)  Schrodinger  equation 

V^(/-)+^[£-K(r)]H/-)  =  0.  (4.8) 

Once  we  have  found  the  space -dependent  part  <|>  (r)  of  the  wave  func- 
tion from  (4.8),  we  can  use  Eq.  (4.2),  which  is  valid  for  all  mono- 
chromatic waves,  to  obtain  the  complete  wave  function,  which 

depends  on  both  (spatial  and  time)  coordinates.  Substituting  «>  =  -^-, 
we  have1 


From  now  on,  we  shall  write  the  wave  functions  which  depend  on  both  position  and 
time  in  the  form  l^(0»  wh1*6  t^e  wave  functions  whose  only  argument  is  the  position  will 
be  denoted  as  (//. 


THE    TIME-INDEPENDENT    SCHRODINGER    WAVE    EQUATION  59 

For   the   complex   conjugate   function,    whose  space-dependent 
part  also  satisfies  Eq.  (4.8),  we  have 


•  £/ 


(4.9a) 


B.^ESTRICTIONS  ON  THE  WAVE  FUNCTIONS. 
EIGENVALUES  AND  EIG  EN  FUNCTIONS 

The  functions  <|>  (t)  which  describe  the  behavior  of  a  particle  may 
be  statistically  interpreted  by  means  of  the  Schrodinger  theory.  In 
particular,  the  quantity  fy*(t)ty(t)  =  ty*ty,  which  plays  the  role  of  a 
distribution  function,  represents  the  probability  density,  or  prob- 
ability of  finding  the  particle  at  any  particular  region  in  space.  If 
the  probability  density  differs  from  zero  only  in  some  arbitrarily 
large,  but  finite,  region  of  space  Q9  it  is  accurate  enough  to  say 
that  the  particle  is  located  somewhere  in  this  region.  In  other 
words,  the  probability  of  detecting  the  particle  in  the  region  £  is 
unity.  Mathematically,  this  can  be  expressed  in  the  form 

(4.10) 

In  quantum  mechanics,  relationship  (4.10)  is  called  the  normaliza- 
tion condition. 

It  is  important  to  note  that  the  region  of  nonzero  probability 
density  is  not  always  finite.  There  are  cases  where  <]>*<)>  does  not 
go  to  zero  over  all  of  space  (the  simplest  of  these  cases  is  that  of 
the  motion  of  a  free  particle,  which  we  shall  consider  below).  When 
this  happens,  $ty*tyd*x  diverges,  and  a  somewhat  different  formula- 
tion of  the  normalization  condition  must  be  given. 

We  shall  now  give  a  general  analysis  of  the  Schrodinger  wave 
equation.  The  Schrodinger  equation  (4.8)  is  a  second-order  partial 
differential  equation.  Its  solution  resembles  the  solution  of  certain 
classical  problems  of  mathematical  physics,  such  as  the  problem 
of  a  vibrating  string.  We  note,  first,  that  certain  conditions 
must  be  imposed  on  the  wave  function  <|>f  since  it  is  a  solution 
satisfying  a  second-order  Sturm-Liouville  equation.  It  must  be 
continuous  and  have  a  continuous  derivative.  Moreover,  it  must  be 
single  valued  and  finite  over  all  of  space.  Finally,  it  must  satisfy 
certain  boundary  conditions.  In  general,  solutions  wjnch^satisfjf 
these  requirements  do  not  exist  for  all  values  of  the  parameters, 
but  only  for  certain  specific  values,  which  are  known  as  eigen- 
values. In  the  case  of  the  Schrodinger  wave  equation,  the  energy 
E  is  a  parameter  of  this  sort,  its  eigenvalues  being 

£1,  Ea,  £„....  ( 
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The  solutions  of  the  wave  equation  corresponding  to  these  eigen- 
values 

*i,  fc,  *s,  ...  (4.12) 

are  said  to  be  eigenfunctions.  The  possible  values  of  the  energy 
form  the  energy  spectrum.  We  shall  see  below  that,  if  the  motion 
of  a  particle  is  not  bounded,  its  energy  spectrum  is  continuous. 
If,  however,  the  position  of  the  particle  in  space  is  bounded,  the 
energy  spectrum  is  discrete. 

Let  us  show  that  the  eigenfunction  fyn  satisfy  the  orthonormality 
condition 

lWn<Px  =  *ttn;  (4.13) 

where  <W  is  the  Kronecker-Weierstrass  symbol,  which  is  equal 
to  unity  for  n'~n  (the  normalization  condition),  and  to  zero  for 
ti  £n  (the  orthogonality  condition).  To  prove  Eq.  (4,13),  we  write 
the  Schrodinger  equations  for  tyn  and  <|)S»: 


0,  (4.8a) 

^  «•  +  -%?  (E^  -  V)  $•  =  0.  (4.  8b) 

If  we   multiply   the  first  of  these  equations  by  <|*J'f  and  the  second 
by  (~~tn)>  and  then  add  the  two  resulting  equations,  we  obtain 

^•V'tn  -  *nV  V  +  ?£°-  (En  -  En.)  ^n  =  0.  (4.  14) 

Since 

$'V*y*  —  W^*=V-B, 
where 

fl=W«v*«  —  ^vi*. 

integration  of  Eq.  (4.14)  over  all  of  space  yields 

J  V  •  *  d»JC  +  ^  (£»  -  E^)  J  W'tnd'Jf  =  0  (4.15) 

Since  the  <|>  function  tends  to  zero  at  infinity,  we  obtain2 


2The  surface  S  -  4nr2  tends  to  infinity  at  r  >  «>.  Therefore,  the  integral  goes  to  zero  at 
/•  ->  no  if  the  wave  function  l/J  tends  to  zero  more  rapidly  than  r"1.  This  condition  is  always 
satisfied  for  a  discrete  spectrum  since  the  wave  function,  as  a  rule,  approaches  exponen- 
tially zero  at  infinity.  The  case  of  the  continuous  spectrum  will  be  considered  separately. 
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Therefore,  instead  of  Eq.  (4.15),  we  have 

*,tynd*x  =  0.  (4.16) 

We  shall  now  assume  that  En-/-En'  (that  is,  n'-£n).  According  to 
Eq.  (4.16),  the  following  equation  then  holds  (orthogonality  condi- 
tion): 

i  ([^'tjjflf/'.vszsO  (4.17) 

If,  however,  n'  =  n  (or  En  =  En>),  this  integral  does  not  go  to  zero 
and  we  may  impose  the  requirement  that  it  be  equal  to  unity  (the 
normalization  condition): 

v=l.  (4.18) 

Thus  the  eigenfunctions  <J>i,  ty*  and  '[>H,  which  correspond  to  the  eigen- 
values Elt  El  andEa,  do  indeed  possess  the  property  of  orthonormality 
(4.13).  This  is  one  of  the  most  important  characteristics  of  eigen- 
functions. 

C.    A  PARTICLE  IN  A  POTENTIAL  WELL 

As  an  example  of  the  calculation  of  eigenvalues  and  eigenfunc- 
tions, we  shall  consider  the  motion  of  a  particle  in  a  potential  well. 
Since  the  chief  interest  of  this  problem  is  simply  that  it  provides 
an  illustration  of  methods  used  in  the  solution  of  this  example,  we 
may  assume  a  very  simple  dependence  of  the  potential  energy  on 
distance  (see  Fig.  4.1): 


IVQ  for   —  :o<\v<0    (region  I) 
0  for       0  <></    (region  II)  (4.19) 

VQ  for       /  <  x  <  co  (region  III) 

In  the  potential  well  (region II),  where  E>l/=0,the  Schrodinger 
equation  takes  the  form 

6n  |-£2<fn  =  0,  (4.20) 

where 

and 

(4.20a) 
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We  note  that  the  case  £<0  has  no  physical  meaning  in  this  problem. 
Since  the  general  solution  of  Eq.  (4.  20)  is  oscillatory,  we  have 


<]>n  =  BU  cos  kx  +  A  ii  sin  kx. 
In  regions  I  and  III,  the  Schrodinger  equation  has  the  form 


(4.  21) 


(4.22) 


Here  two  cases  must  be  distinguished.  In  the  first  case  (£>  V,,), 
the  solution  for  these  regions  is  also  oscillatory  in  character  (an 
equation  of  the  elliptic  type).  It  is  given  by  Eq.  (4.21),  the  value 
of  k  being 


No  restrictions  need  to  be  imposed  on  the  wave  functions  at  infinity. 
Therefore,  the  energy  E  can  assume  any  value  in  a  continuous 
spectrum  of  energies.  It  is  better,  however,  not  to  investigate  the 
case  of  a  continuous  spectrum  on  the  basis  of  this  example,  but 
rather  on  the  basis  of  the  motion  of  a  free  particle  (see  below). 
The  potential  well  only  adds  to  the  mathematical  difficulties  of  the 
problem,  without  changing  the  general  character  of  the  solution. 


Vfjc) 


• 

m 

n 

\  * 

Ujl 
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I 

Fig.  4.1.  The  motion  of  a  particle  in  a 
potential  well. 


In  the  second  case,  namely,  the  case  of  a  potential  barrier 
(£<V0),  the  solution  of  Eq.  (4.22)  is  exponential  in  character  (an 
equation  of  the  hyperbolic  type).  The  general  solution  can  be 
written  in  the  form 


<K  m  =  4  1. 


where 


,  me-* 


(4.23) 


(4.24) 
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If  the  energy  can  assume  any  value  without  restriction,  the  wave 
function  inside  the  potential  barrier  (0<£<l/0)  will  contain  both 
an  exponentially  increasing  part  and  an  exponentially  decreasing 
part  (see  Fig.  4.2).  Therefore,  we  must  choose  only  those  values 
of  E  for  which  exponentially  increasing  solutions  do  not  exist  inside 


Exponentially 

increasing 

solution 


Exponentially 

decreasing 

solution 


0  I  I 

Fig.  4.2.  Wave  function  for  a  given  value  of  E. 
The  energy  level  is  taken  to  be  the  abscissa  of 
the  wave  function 

the  potential  barrier.   Accordingly,  we  require  that  the  coefficient 
#i  =  0  in  region  I(#<0),andthecoefficient  j4iu=0in  region  III 
We  then  have 


(4.25) 


where,  for  the  sake  of  simplicity,  we  have  made 

Bin=Be«. 

By  joining  the  solutions4  at  the  boundary  of  regions  I  and  II  (x  =  0), 
and  also  at  the  boundary  of  regions  Hand  III  (#  =  /),  and  making  use 
of  the  requirement  that  the  exponentially  increasing  solution  vanish, 
we  obtain  the  equation  for  the  eigenvalues  of  the  energy  E. 

We  shall  now  further  simplify  our  problem  by  requiring  that 
VQ9  together  with  *,  go  to  infinity  (see  Fig.  4.3).  It  is  apparent  from 
Eq.  (4.25)  that  <j>i  =  <pin  ==  0,  and  therefore  the  boundary  conditions 
for  the  solution  (4.21)  inside  the  potential  well  (region  II)  take  the 
form 


and 


for  x  =  0 
for  jc  =  /. 


(4.26) 
(4.27) 


3When  E  <  V0,  the  number  of  unknown  coefficients  in  the  wave  function  is  smaller 
than  the  number  of  imposed  conditions.  Accordingly,  solutions  are  possible  only  for  certain 
values  of  E  and  a  discrete  spectrum  is  obtained. 

4In  joining  the  solutions,  we  must  match  the  actual  wave  functions  (and  also  their 
first  derivatives)  at  the  appropriate  point. 
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Applying  Eq.  (4.26)  and  (4.27)  to  the  general  solution  (4.21)  in 
region  II,  we  find  that  Bu  =  0,  and  the  eigenvalue  are  described  by 
the  equation 


sin/e/  = 


from  which 


(4.28) 


(4.29) 


where  n  =  1,  2, 3, 4, ....  We  exclude  the  value  n  —  0  from  further 
considerations,  since  the  wave  function  in  this  case  is  identically 
equal  to  zero.  It  is  not  necessary  to  consider  separately  the  nega- 
tive values  of  //,  since  the  wave  functions  for  negative  n  are  equal 


^=00 


Fig.  4.3.  Particle  in  a  potential  well  with 
infinitely  high  walls. 


to  the  wave  functions  for  positive  n,  taken  with  the  opposite  sign. 
Since  kl  =  -j~  E,  we  obtain  the  following  equation  for  the  energy 
spectrum  (the  eigenvalues): 


(4.30) 


The  wave  functions  corresponding  to  these  values  of  energy  (eigen- 
functions)  are 


(4.31) 
The  coefficient  An  can  be  found  from  the  normalization  condition 

/  / 

\  ^dx=  An  \  sin'2  vn  ~  dx  —  -^  /!?,=  !, 
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which  gives 


Substituting  the  expression  for  An  into  Eq.  (4.31),  we  finally  obtain 

.  (4.32) 


According  to  the  general  theorem  of  eigenfunctions  [see  Eq. 
(4.17)]  the  eigenfunctions  (4.32)  of  the  Schrodinger  equation  satisfy 
the  orthogonality  condition 


=  0  for  n'^n,  (4.33) 


as  can  be  readily  seen  by  performing  the  direct  integration  after 
substituting  Eq.  (4.32)  for  tyn. 

We   shall   now   write   down   a    few  specific  eigenvalues  En  and 
eigenfunctions  <!>„,  shown  in  Fig.  4.3: 


r  ,       -i         .     . 

Li  =  ~IiiF>     ti=]/7sinT'  (4.34) 

£a  =  4£lf     fc=|/~4sin2*-y,  (4.35) 

£a  =  9£lf     fc=  sin  3*.  (4'36) 


These  solutions  are  very  similar  to  the  familiar  standing-wave 
solutions  for  a  vibrating  string  with  fixed  ends.  The  case  n  =  1 
[see  (4.34)]  corresponds  to  the  fundamental  mode,  the  case  n  =  2 
[see  (4.35)],  to  the  first  harmonic,  etc. 


D.    THE  MOTION  OF   FREE  PARTICLES. 

NORMALIZATION  OF  WAVE   FUNCTIONS  IN  THE   CASE 

OF  A  CONTINUOUS  SPECTRUM 

We  shall  consider  the  motion  of  a  free  particle  (taking  the  one- 
dimensional  case  first),  when  the  Schrodinger  equation  (4.8)  has 
the  same  form  in  all  of  space  (—  OD<.V<[OO): 


(4.37) 
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where 

k*  =  ^=£.  (4.38) 

The  solution  of  this  equation  is 

<[»  =  4  *'**  +  &?  ikx.  (4.39) 


To  determine  the  physical  meaning  of  each  term  in  Eq.  (4.39),  we 
write  the  complete,  time-dependent  function 

y(t)  =  e'mty=Ae-'(°'-**}  +  Be-*(m'+**>.  (4.40) 

It  is  seen  that  the  first  term  Ae~i(wt^hx)  describes  the  motion  of  the 
particle  in  the  positive  direction  of  the  #  axis,  and  the  second  term 
/fe-ii«'+**)f  the  motion  in  the  negative  direction.  If  we  restrict  our- 
selves to  the  traveling  wave  which  is  propagated  in  the  direction  of 
positive  x,  we  have 

*t  =  Aeik\  (4.41) 

Y 
It   is  readily  seen  that  the  integral      \     ty*tydx  diverges.  Con- 

—  00 

sequently,  we  must  revise  the  method  of  normalization  [see  Eq. 
(4.10)].  There  are  two  basic  methods  of  normalizing  ^functions  of 
this  sort.  We  shall  devote  most  of  our  attention  to  one  of  them, 
which  was  proposed  by  Born.  Thus,  rather  than  imposing  a  bound- 
ary condition,  we  shall  subject  the  wave  function  to  a  periodicity 
condition 

»(*)  =  *  (*  +  *-).  (4-42) 

where    the    length    L    is    said   to   be   the  period.  It  can  be  made 
arbitrarily  large.  As  a  rule,  L  does  not  appear  in  the  final  result. 
Rewriting  (4.42)  in  the  form 


we  obtain  eikL=  1  ,  which  implies 

*=?.  (4.43) 

where  /i=0,   db  1,   it  2,   ±3  .....    From  Eqs.  (4.38)  and  (4.43),  the 
energy  levels  are 
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Since  the  <]>  function  is  periodic  in  the  interval  L,  the  normalization 
condition  becomes 


1/2 

f  ty*tydx=  I.  (4.45) 

-Zja 


-Z/2 

Substituting  the  expression  (4.41)  for  <[>,  we  have 

4  =  4=.  <4-46) 


Therefore,  the  normalized  solutions  are 


(4.47) 


The    direct    integration   shows   that  the   functions    (4.47)    are 
orthonormal.  As  can  be  readily  shown  by  direct  integration: 


L/2 

-T    \ 

-1/2 


1/2  

"'</*  = 


-1/2 

for     n'^Ai,  (4.48) 

[l       for     n'  =  n. 

Therefore,  introducing  the  artificially  defined  period,  we  can 
transform  the  continuous  energy  spectrum  into  a  discrete  one. 
If,  however,  the  length  L  is  allowed  to  go  to  infinity,  the  discrete 

spectrum  again  becomes  continuous.  Since  k=^-  =  r~  and  A/i=lf 
the  energy  difference  between  two  neighboring  levels  is 

A  r*  n"K  2.TI  2,TtTl  i  A     A  c\\ 

A£  =  —  T  =  v~r*  (4.49) 

It  immediately  follows  that  A£  —» 0  if  L  — *  oo;  that  is ,  the  energy 
spectrum  becomes  continuous. 

We  shall  now  generalize  the  free  particle  problem  to  the  three- 
dimensional  case.  The  Schrodinger  equation  can  be  written  in  the 
form 


Just  as  in  the  one-dimensional  case,  the  quantity  k2  is  given  by 
Eq.  (4.20a).  We  shall  solve  Eq.  (4.50)  by  the  method  of  separation 
of  variables;  that  is,  we  assume  a  solution  of  the  form: 
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Substituting  this  expression  into  (4.50)  and  multiplying  by 

1  1 


we  find 

<;,"(*)    ,  <!/"<»    ,  r  (2)    ,     2__n 

*  (.  )'  +  TOO  +  fW  +  k  -  °'  (4'52) 

where  the  primes  denote  the  derivatives  of  the  corresponding^ 
function  with  respect  to  its  argument.  We  note  that  an  equation  of 
this  type  is  satisfied  only  if  each  term  (fraction)  is  independent  of 
the  coordinates  and  is  equal  to  a  certain  constant.  We  thus  obtain 
the  following  equations  for  the  functions  <[>(#)»  ty(y)  and  ^(2): 


<!/'  (x)  +  w  W  =  o.  r  (y)  +  W  (y)  =  u> 

^o  <4-53) 


where 

(4.54) 


Taking  a  traveling  wave  propagating  in  some  specific  direction  as 
a  solution  for  each  of  Eqs.  (4.53),  we  get 

^(x)  =  Aeif^xt  ty(y)  =  Belll*vt  ^(z)=Celh^.  (4.55) 

The  unknown  coefficients  A  >  B  and  C  are  determined  from  the 
normalization  condition,  assuming  that  the  functions  <!>  (A;),  ty  (y)  and 
<p(z)  are  periodic  in  the  interval  L.  We  thus  obtain 

i  i 

(4.56) 


where 

and  n       n      n   —  0     -4-  1      "I-  9     -4-  1 

"i»  fh*  **j  —  u,   --1-  *>    —  ^»   -2- w,    ...   < 

In  this  case,  the  particle  has  energies  of 

/HO/.  a  /fioZ,a  '  (4,58) 

where 

(4.59) 
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Substituting  the  functions  (4.56)  into  Eq.  (4.51),  we  have 

_  3^ 
^  =  L~2ei».rt  (4.60) 

These  wave  functions  will  satisfy  the  orthonormality  condition 


The  complete,  or  time-dependent,  wave  function  can  now  be  written 
in  the  form 


-r>,  (4.62) 

where 

p  =  ftk,  E  =  J£a.  (4.63) 

We  note  that  the  energy  spectrum  is  again  continuous,  just  as 
in  the  case  of  one-dimensional  motion.  This  can  readily  be  shown 
in  the  same  way  as  before  by  finding  the  energy  difference  between 
two  adjacent  levels  and  then  letting  L  go  to  infinity. 


E.    FUNDAMENTAL  PROPERTIES  OF  6  FUNCTIONS. 

6-FUNCTION  NORMALIZATION  OF  THE  WAVE  FUNCTION 

IN  THE  CASE  OF  A  CONTINUOUS  SPECTRUM 


>i 

As  we  have  already  mentionpdfone  ol  the  fundamental  properties  of  elgenf  unctions  is 
their  orthonormality  [see  Eq.  (4,13)].  An  arbitrary  function  /-'  (.v)  with  no  "special" 
properties  can  thus  be  expanded  in  a  series  of  orthonormal  functions  belonging  to  a 
complete  set: 

»<M*)  (4.64) 

Multiplying   this   equation  by   fy*,  (.v)  dx  and  integrating   over  the  entire  range  of  X9  we 
obtain 


ft.  (.v)  F  (x)  dx  =       Cn    4,*.  (X)  <,„  (A-)  dx. 

"     ' 

Because  of  the  orthonormality  condition,  we  have  ^ 


(.v)/-'(-v)^.  (4.6S) 

In  order  to  bring  tyn  (x)  under  the  integral  sign,  we  change  here  the  variable  of  integra- 
tion .v  to  x'  and  substitute  the  expression  for  Cn  into  the  expression  (4.64): 

x'F  (.V)  *•  (*•)  •}-„  (.v).  (4>66) 
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This  expansion  is   a  generalized   Fourier  series.    We  can  obtain  the  ordinary  Fourier 
series  from  it  by  substituting  the  harmonic  functions  (4.47)  for  fyn  (x) . 
We  note   that  the   sum  over    n  in  Eq.  (4.66)  is  a  &  function: 


(4.67) 


since  it  transforms  the  function  F(x')  into  F(x\ 

In  this  respect,  this  &  function  resembles  the  Kronecker-Weierstrass  &  symbol,  which 
possesses  a  similar  property  with  respect  to  the  subscripts  n  and  /if: 

/,fn*nn'  =/n. 


It  follows  that  the  &  function  differs  from  zero  only  in  a  very  narrow  region  x'  —  x9  Just 
as  the  8  symbol  differs  from  zero  only  for  n1  —  n.  By  setting  F(JC')  =  I  in  (4.66),  we  obtain 
one  of  the  fundamental  properties  of  the  &  function: 


I  &  (jt  — *')£/*' =  1. 


(4.68) 


It  is  interesting  to  note  that  the  relations  (4.66)  and  (4.68)  are  completely  independent  of 
the  particular  set  of  orthonormal  functions  used  to  construct  the  ft  function.  In  the  simplest 

case,  the   &  function  can  be  constructed  from  the 

( 0(X-Xft(X.}  orthonormal  functions  (4.47),  which  are  used  in 

expansion  into  a  Fourier  series: 


(4.69) 


2it/i 


Introducing   a  new  variable  £  =  --  and  making 


use    of    the    relation     Aft 


n 
An  «=          (since 


An  =  1),  we  can  transform  (4.69)  to 

Fig.  4.4.  Graph  of  the  'smeared*  j  /  „ ~»\  s=  _J_  £Aft*'*  (•*  —  *'! 

5  function.  2rc 

If  we  now  let  L  go  to  infinity  (A&  — *  0),  the  last  sum  is  changed  into  an  integral: 


&  (x  -  x')  =  -A   f  dke1*  (x  - 


(4.70) 


The  range  of  integration  over  k  should  be  taken  from  —  co  to  +  oo.  We  note  that  the  & 
function  belongs  to  the  class  of  functions  that  are  known  as  "improper  functions."  This  is 
manifested  in  the  impossibility  of  direct  evaluation  of  the  integral  (4.70),  since  it  diverges 
at  x  =  jc'.  Therefore,  if  we  wish  to  obtain  a  representation  of  the  &  function,  we  have  to 
"smear  out"  somewhat  the  integrand  of  (4.70).  For  instance,  the  5  function  may  be 
"smeared  out"  as  follows: 


-  KS. 

c  —  A:')' 


(4.71) 


A  graph  of  the  smeared  5  function  is  given  in  Fig. 4.4.  At  the  limit  a-*  0,  the  function 
must  have  the  following  properties: 
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8  (x  —  x')  =  co  for   x  =  x\ 

*(x  —  x')  =  0    for   x^x\  (4.72) 


As  we  approach  the  limit  a  —  *>  0,  the  area  between  the  ft  function  and  the  x  axis  remains 
unchanged: 

J  •  (,-^-lta  I  J  d*'  ai  +  (*_x),    =1-  (4.73) 

The  three-dimensional  6  function  can  be  similarly  defined  as 

»  (r)  =  6  (x)  &  (y)  8  (*)  =  J-  j  d'*e'*r  .  (4.74) 

It  satisfies  the  following  conditions: 

8(r)  =  0      if    r^-0, 

=l.  (4.75) 


The  integrals   (4.74)  and   (4.75)  are   three-dimensional.    The  former  extends  over  the 
entire  wave-number  space  (k\  and  the  latter,  over  the  entire  coordinate  space  (r). 

It  is  very  convenient  to  express  the  density  of  a  point  charge  with  the  help  of  a  three- 
dimensional  &  function: 

P(r)  =  *5(r).  (4.76) 

Substituting  this  expression  into  Poisson's  equation 

V*  &  =  -  4^5  (r)  =  -  -  2*2  f  d»keikr  , 
we  can  determine  the  potential  of  a  point  charge  * 


Thus,  the  three-dimensional  6  function  enables  us  to  describe  the  singularity  at  the  point 
r  -*  0  of  the  Laplacian  operator  applied  to  1/r: 


In   determining  the  potential  $,    we  made  use  of  the  rule   for  dividing  an  exponential 
function  by  the  Laplacian  operator: 

alkr 
*r    _     f__ 

The  validity  of  this  rule  is  immediately  obvious,  since 

v2(v2rv*'  -  v2  *—  =  etkr 
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We  shall  not  discuss  the  properties  of  the  &  function  any  further  and  shall  proceed  to 
use  it  to  normalize  the  continuous  spectrum  for  one-dimensional  free  motion.  From 
(4.41),  the  wave  functions  for  this  case  are  of  the  form 


<|,*  (/>')  =  /*<?      n     -  (4.79) 

The  normalization  of  these  functions  is  related  to  the  6  function  in  the  following  way: 

+?    »('-4) 

'x  =  A*  \  dxe     vn      "    = 


—  GO 


=  A^nhb  (p  —/>')  —  5  (p  —  p').  (4.80) 

From  this,  we  obtain 

___       1 

Therefore, 


Let  us  compare  the  ordinary  normalization  of  wave  functions  with  the  8  function 
normalization.  We  can  write  the  orthonormality  condition  for  functions  normalized  by  the 
ordinary  method  as  follows: 


"a  ^ 

\  I 

7^  J 

' 


//»•=   J1*  if  /:  lles  inside  the  range  rii  <  n, 
n"-         V),  if  /i  lies  outside  the  range  /li-^ii 
v 


Similarly,   replacing   the   sum   in   Eq.  (4.82)  by  an  integral,  we  can  obtain  the  following 
generalized  form  for  wave  functions  which  are  normalized  with  the  help  of  a  &  function: 


'•J  • 

I*  dp*  ( 

J     ^  J 

Pi 


inside  the 
O.  if  />  lies  outside  the  range  pL  <zpt. 


Analyzing  the  solutions  that  we  have  obtained  for  a  particle  in  a  potential  well  and  for 
a  free  particle,  we  arrive  to  the  following  conclusions.  If  the  condition  V  ^>  K  is  satisfied 
at  all  points  of  space  at  infinity,  the  energy  spectrum  is  discrete.  If,  on  the  other  hand, 
there  are  regions  at  infinity  in  which  V  <.  /.',  the  energy  spectrum  is  continuous.  While 
this  conclusion  was  obtained  on  the  basis  of  solutions  for  a  rectangular  potential  well,  it 
is  quite  general  as  long  as  the  potential  energy  is  a  continuous  function  of  .v,  y  and  z. 


Chapter  5 

The  Time-Dependent  Schrodinger 
Wave  Equation 


It  has  been  shown  that  the  solution  of  the  time- independent  or 
stationary  Schrbdinger  wave  equation  (4.8)  amounts  to  a  determina- 
tion of  eigenvalues  En  (the  spectrum  of  energy  levels)  and  eigen- 
functions.  The  time-dependent  wave  function  tyn(t)  of  a  given  state 
was  found  by  multiplying  tyn  by  e~(ilh}  En'9  The  function ^,,(0  obtained 
in  this  way  describes  only  strictly  monochromatic  processes  (that 
is,  with  only  one  value  of  energy).  There  is,  however,  a  more 
general  form  of  the  Schrodinger  equation,  which  depends  explicitly 
on  time  and  may  be  used  in  a  much  larger  class  of  problems.  This 
equation  is  known  as  the  time-dependent  Schrodinger  wave  equation. 


A.^TRANSITION  TO  THE   TIME-DEPENDENT  SCHRODINGER 

WAVE  EQUATION 

s* 

To  obtain  the  time-dependent  Schrodinger  wave  equation,  it  is 
necessary  to  eliminate  the  energy  E  from  the  time-  independent 
equation,  where  it  appears  as  a  constant  parameter.  The  time- 
independent  Schrodinger  wave  equation  (4.8)  can  be  written  in  the 
form 


The  relation  that  is  used  to  eliminate  E  from  this  equation  is 

_-fiM=£HO.  (5.2) 

Accordingly,  the  time-dependent  Schrbdinger  wave  equation  is 

()-  <5-3> 


This  equation  is  more  general  than  the  previous  one.  In  particular, 
it  can  be  used  for  the  description  of  processes  in  which  the  poten- 
tial energy  V  is  a  function  of  both  position  and  time. 
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If  the  potential  energy  V  does  not  depend  on  time,  it  is  necessary 
to  solve  only  the  time- independent  Schrodinger  equation  by  finding 
all  the  possible  energy  eigenvalues  En  and  the  corresponding 
eigenfunctions  <j>n.  The  wave  function  which  satisfies  Eq.  (5.3)  is 
related  to  these  partial  solutions  by  the  linear  equation 


/-^V^.  (5.4) 

To  prove  this,  we  substitute  (5.4)  into  (5.3),  remembering  that  Cn 
are  constant  coefficients  and  that  tyn  satisfies  the  equation 

It  is  then  readily  seen  that  <KOis  a  general  solution  of  (5.3),  since 


(56) 


The  case  of  a  monochromatic  wave  is  a  special  case  of  the  general 
solution  (5.4).  The  appropriate  wave  function  can  be  obtained  from 
(5.4)  by  setting 

Crto  =  1  and  Ca  =  0  (if  n  ^  «0). 

As  we  have  presented  it,  transition  from  the  time-  independent 
equation  (5.1)  to  the  time  -dependent  equation  (5.3)  is  essentially 
equivalent  to  a  simple  replacement  of  the  energy  E  by  the  expres- 

sion ih  -5-.  In  quantum  mechanics,  this  expression  is  known  as  the 
energy  operator1 

E  =  M  jp  (5.7) 

The  effect  of  this  operator  on  any  function  amounts  to  an  ordinary 
differentiation  of  the  function  with  respect  to  time.  Thus,  E  is  a 
linear  differential  operator.  In  the  case  of  a  monochromatic  wave, 
for  which 


we  have 


Operators  will  be  denoted  by  roman  characters. 
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Hence,  it  is  seen  that  the  energy  En  is  an  eigenvalue  of  the  energy 
operator  E. 

There  are  other  operators  besides  the  energy  operator  in 
quantum  mechanics.  The  most  important  of  these  is  the  momentum 
operator 


which  takes  its  name  from  the  fact  that  in  the  case  of  a  free  particle, 
its  eigenvalue  is  identical  to  classical  momentum.  Indeed,  operating 
with  p  on  the  wave  function  of  a  free  particle  [see  Eq.  (4.62)],  we 
have 


Thus,  in  this  particular  case,  the  eigenvalue  of  the  operator  p  is 
the  classical  momentum/?. 

With  this  operator  notation,  which  helps  to  bring  out  more 
clearly  the  relationship  between  the  quantum-mechanical  and  clas- 
sical laws  of  motion,  the  Schrodinger  equation  (5.3)  has  the  form 

a  (5'9) 

Thus,  to  carry  out  the  formal  transition  from  the  classical  theory 
to  quantum  mechanics,  it  is  necessary  to  replace  the  energy  E  and 
momentum  p  in  the  classical  equation  for  the  law  of  conservation 
of  energy2 


by   the   corresponding  operators  and  to  operate  with  them  on  the 
wave    function.     The   operator  J^-  is  known  as  the  kinetic  -energy 

operator  T,  andT-f-K  as  the  Hamiltonian  operator  H.  For  the  sake 
of  brevity,  we  shall  call  the  latter  the  Hamiltonian. 


2If  the  electron  is  in  both  an  electric  field  and  a  magnetic  field,  which  is  characterized 
.by  the  vector  potential  A,  then,  using  the  classical  expression  (1.15)  for  the  Hamiltonian, 
the  time-independent  Schrodinger  equation  can  be  written  in  the  form 


E 
f 


(5.9a) 


76  NONRELATIVISTIC    QUANTUM    MECHANICS 

Using  this  operator  notation,  Eq.  (5.9)  can  be  rewritten  in  the 
form 

(E-H)'MO  =  0.  (5.10) 

For   the   time-  independent  potential     V,   the  following  relation  is 
obtained: 


and,  therefore,  the  time-independent  Schrbdinger  equation  reduces 
to  the  form 

(/•„-  1  1).  !>„  =  ().  (5.12) 

It  is  apparent  from  this  that,  m  stationary  problems,  the  eigen- 
values of  the  Hamiltonian  are  equal  to  the  eigenvalues  of  the  energy, 
just  as,  in  the  classical  case,  a  Hamiltonian  function  which  does  not 
depend  explicitly  on  time  is  equal  to  the  energy  of  the  system. 


B.  CHARGE  DENSITY  AND  CURRENT  DENSITY. 
QUANTUM  ENSEMBLES 

In   classical  electrodynamics,  an  important  role  is  played  by 
the  equation  of  continuity 

J.+  V.y=0,  (5.13) 

which  involves  the  charge  density  p  and  the  cur  rent  density  j.  This 
equation  basically  represents  a  general  form  of  the  law  of  con- 
servation of  charge.  To  show  this,  we  multiply  (5.13)  by  d\v  and 
integrate  over  all  space 

-0.  (5.14) 

Reversing  the  order  of  differentiation  and  integration  in  the  first 
integral  (this  is  allowed  since  time  here  is  only  a  parameter),  and 
changing  the  second  from  a  volume  to  a  surface  integral,  we  obtain 

-ijV*+fMS=0.  (5.15) 

If  there  are  no  charges  or  currents  at  infinity,  the  surface  integral 
vanishes,  and  we  obtain  the  law  of  conservation  of  charge 

:  =  const.  (5.16) 
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We  shall  now  use  the  wave  theory  to  find  an  expression  for  the 
charge  and  current  densities.  For  this  purpose,  we  take  the 
Schrodinger  equation  (5.3),  writing  it  in  the  form 


a  (5.17) 

Similarly,  for  the  complex  conjugate  equation,  we  have 

—or  't-^vV(0-'jI  *•  (0=0.  (5.18) 

Multiplying  Eq.  (5.17)  by  ty*(t),  and  Eq.  (5.18)  by  ^  (0,  and  adding 
the  two  equations,  we  obtain 

^.'JOiiO  _|_  2«  v  i  ^  (/)  v  r  (0  _  r  (/)  v  ,,,  (/)  j  _  o.       (5>19) 

Comparing  (5.19)  with  the  equation  of  continuity  (5.  13),  and  consider- 
ing that  the  charge  density  is  equal  to  the  charge  c  of  one  particle 
multiplied  by  the  number  of  particles  per  unit  volume  (that  is,  in 
this  case  the  probability  density),  we  have 


(5-20) 
From  (5.19)  and  (5.20),  we  find  the  current  density 


(5.21) 

It  should  be  noted  that  for  a  monochromatic  wave 


both  the  charge  density 

P^r*  (5.22) 

and  the  current  density 

^=4S-<*V**--**V*>  (5'23) 

are  independent  of  time. 

In  the  case  of  real  wave  functions  (•]>*  =  ^),  the  current  density 
is  always  identically  equal  to  zero.  For  instance,  the  current 
density  of  an  electron  in  an  infinitely  deep,  one-dimensional  potential 
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well  (see  Fig.  4.3)  is  zero  (/=0).  This  is  quite  natural,  since 
oscillations  described  by  real  wave  functions  are  actually  standing 
waves,  and  standing  waves  cannot  give  rise  to  a  particle  flux. 

The  case  of  motion  of  a  free  particle  is  somewhat  different. 
According  to  (4.60),  the  wave  function  for  this  case  describes  a 
traveling  wave: 

<p  =  L-'/«eipr/\  (5.24) 

Substituting  this  expression  for  <|>  into  (5.22)  and  (5.23),  we  obtain 
the  following  expressions  for  the  charge  density  and  the  current 
density,  respectively: 


Hence,  when  the  charge  is  distributed  with  uniform  probability 
over  the  entire  volume,  its  density  is  equal  to  the  charge  divided 
by  the  volume,  as  was  to  be  expected.  Moreover,  the  relationship 
between  the  current  density  and  the  charge  density  remains  the 
same  as  in  classical  electrodynamics. 

Let  us  now  explain  the  physical  meaning  of  the  coefficients  Cn 
which  appear  in  the  solution^  (Oof  the  time-dependent  wave  equation 
[see  (5.4)].  For  this  purpose,  we  substitute  the  expression  for  ty(t) 
into  Eq.  (5.16),  which  serves  to  define  the  conservation  of  total 
charge.  Then,  using  (5.20),  we  obtain 


Since  the  wave  functions  must  satisfy  the  orthonormality  condition 

J  ^W3*==8n«'. 

we  have 

2c;cll=2ic*i'==1-  (5>25) 

n  n 

We  can  now  give  a  physical  interpretation  to  the  coefficients  Cn  as 
quantities  characterizing  the  probability  that  a  particle  exists  in 
the  quantum  state  n.  Indeed,  if  we  know  with  certainty  that  the 
particle  is  in  the  state  /?„  (E  =  Eno),  all  of  the  coefficients  Cn,  except 
C,,0,  will  have  to  be  equal  to  zero;  that  is,  we  must  set  Cn  =  tnn<>.  If 
the  particle  has  a  nonzero  probability  of  occurring  in  two  or  more 
states,  then,  accordingly,  two  or  more  coefficients  will  differ  from 
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zero.  The  actual  probability  that  a  particular  state  is  occupied  is 
given  by  JCJ2,  while  the  probability  density  of  the  distribution  of 
the  state  over  entire  space  is  given  by  I  tyn  \*. 

When   there    is    a   large   number  N    of  particles  in  a  region  of 
space,  we  have,  instead  of  (5.25), 


Here  the  coefficients   Cn  characterize  the  distribution  of  the  total 
number  of  particles  among  the  different  quantum  states. 

In  this  connection,  we  shall  introduce  the  concept  of  quantum 
ensembles:  namely,  a  collection  of  identical  quantum  states  that 
are  described  by  the  same  wave  function  fy.  A  quantum  ensemble 
can  be  used  to  describe  an  electron  beam,  photon  flux,  and  so  on.  3 
Two  different  cases  are  possible.  f 

1)  All  of  the  particles  are  in  the  same  quantum  state  Ho.'lC/iJ'2—  N 
and  iCw|2  =  0  for  n  /  n{}. 

2)  The  particles  have  a  definite  distribution  among  the  various 
quantum  states: 

1^,1*  =  ^,     \Cnj\*  =  Ntt2,    \CHt\*  =  Nnt,...9  (5.26) 

where 


=  N. 


In  the  second  case,  the  motions  of  particles,  say,  in  the  states//  —  /*, 
and  n  =  n29  cannot  be  considered  independently  of  one  another, 
because  whenever  two  states  tyni  and'fn,  are  both  possible,  the  total 
wave  function  is  a  superposition  of  the  individual  states,  namely, 


This  is  important  in  determining  the  total  probability  of  a  state, 
which  is  proportional  to  the  product  ^*  (O'MO-  m  addition  to  terms 
such  as  W,(0^.(0»  this  product  will  contain  mixed  terms  of  the 
form  W,  (Ofi^j(0.  In  other  words,  each  particle  will  possess  a 
definite  probability  of  being  in  both  quantum  states.  The  mixed 
terms  differ  from  zero  for  coherent  y  waves  (a  pure  ensemble), 
and  this  leads  to  interference  of  the  de  Broglie  waves,  which  does 
not  happen  with  incoherent  waves  (a  mixed  ensemble).  Thus,  in  a 
pure  ensemble,  waves  are  added,  whereas  in  a  mixed  ensemble, 
intensities  are  added.  Quantum  ensembles  are  useful  with  regard 


3See   D.    I     Blokhmtscv,   Principles   of  Quantum  Mechcuiics   (trans  ),    Allyn  and   Bacon, 
1964 


80  NONRELAT1VISTIC    QUANTUM     MECHANICS 

to  the  statistical  interpretation  of  the  results  of  wave  theory.  For 
a  large  number  of  particles  or  a  large  number  of  states  occupied 
by  single  particles,  it  follows  from  the  law  of  large  numbers  that 
the  probability  of  some  particular  process,  when  calculated  on  the 
quantum-mechanical  basis,  should  agree  with  the  distribution  that 
can  be  observed  in  experiment  (see  Chapter  7).  Similarly,  in 
classical  statistical  physics,  the  probability  distribution  agrees  with 
experiment  independently  of  the  nature  of  the  *  'hidden' '  parameters. 


CVCONNECTION  BETWEEN  THE  SCHRODINGER  THEORY 
/£>AND  THE   CLASSICAL  HA  MILTON- JACOBI  EQUATION 

The  Hamilton-Jacob!  equation,  which  is  used  in  classical  me- 
chanics to  describe  the  motion  of  a  material  particle  in  a  field  of 
forcf ,  is  a  first-.ozder  nonlinear  differential  equation.  We  shall 
briefly  Yecapitulate  its  derivation,  sta>tirig~from  the  classical  law 
of  conservation  of  energy 

E=  f--\-V  =  T-\rV.  (5.27) 

2Mo     '  '  *  ' 

Let  us  introduce  the  action  function 

t  t 

S(t)=  \Xdt=  \(2T  —  E)dt  =  S  —  Ett  (5.28) 


where 

/ 
S=$2Tdt.  (5.29) 

X 

As  we  shall  show,  the  dependence  of  S  and  t  in  (5.29)  is  only  im- 
plicit, namely,  S  =  S(v(/),  tj(t),  z(t))9  and  therefore  we  have  written 
it  without  including  /  as  an  argument.  Thus,  we  can  distinguish 
between  the  stationary  action  function  S  and  the  time-dependent 
action  function  S  (/)  in  the  same  way  as  we  distinguish  between  the 
stationary  wave  function  and  the  time-dependent  wave  function. 

To  show  that  5  is  not  an  explicit  function  of  /  in  (5.29),  we  take 
the  total  differential  of  5: 

dS  =  2Tdt. 
From  the  equations 


d,,+      d,.  (5-30) 

2Tdt  =  mc  (.«'  -1-  ,/*  +  P)  dt  =  pjx  +  iyly  -j-  ptdz, 
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wefindthat     ' 


and  thus  5  does  not  depend  explicitly  on  t.  Moreover, 

p=VS.  (5.31) 

From  Eqs.  (5.31)  and  (5.27),  we  obtain  the  stationary  Hamilton- 
Jacobi  equation 

v'--£=0-  <5-32> 


We  can  introduce  the  time-dependent  action  function 


into  this  equation  by  making  use  of  the  relation  p  =  vS  =  VS(/)t 
and    eliminate    the    parameter    E    with    the   help   of  the  relation 


—        .    We  then  obtain  the  time-dependent  Hamilton-  Jacobi 
equation,  which  can  be  used  in  nonstationary  problems: 

0.  (5.33) 


The  stationary  and  time-dependent  Hamilton-  Jacobi  equations  cor- 
respond to  the  stationary  and  time-dependent  Schrodinger  equations, 
respectively. 

It    can   be    readily    shown   that    in  the  case  of  motion  of  a  free 
particle  (K  =  0,  p  =  const,  /;=  const),  the  action  function  is 

S(t)  =  -Et+pr.  (5.34) 

To  see  this,  it  is  sufficient  to  substitute  the  expression  (5.34)  for 
S(0into  (5.33).  Let  us  take  special  note  of  the  fact  that  the  function 
for  the  motion  of  a  free  particle  is  [see  (4.60)  ] 


(5.35) 

The  relationship  between  the  wave  function  and  the  action  func- 
tion will  hold  in  general  form  whenever  we  make  the  transition 
from  the  Schrodinger  equation  to  the  Hamilton-  Jacobi  equation.  We 
shall  consider  further  only  the  stationary  case,  for  which 


(5.36) 
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We  start  with  the  Schrodinger  equation  in  operator  form 

o.  <5-37> 

Making  use  of  the  momentum  operator  p  =  —  ih\j,  we  obtain  from 
(5.36) 

(5.38) 
'VS)*.  (5.39) 


We  note  from  Eq.  (5.38)  that  Eq.  (5.36)  leads  to  the  same  relation 
between  the  momentum  operator  p  and  the  action  function  S  as  in 
the  classical  theory  if  we  replace  the  momentum  operator  by  the 
classical  momentum  p.  Substituting  (5.39)  into  (5.37),  we  obtain 


/!_ 

I  2/Wo 


(5.40) 


which  is  simply  the  transformed  Schrodinger  equation. 

To  obtain  the  Hamilton-Jacobi  equation,  we  must  neglect  the 
last  term  in  (5.40),  that  is,  set  #~0.  It  is  a  well  known  fact  that 
the  quantum-mechanical  equations  transform  exactly  into  the 
classical  equations  when  #  =  ().  if,  on  the  other  hand,  h  yt  0,  but  the 
condition 


is    satisfied,    the   quantum-mechanical   terms  provide  only  small 
corrections  to  the  classical  equations.  The  approximation  corre- 
sponding to  this  case  is  known  as  the  quasi-classical  approximation. 
Since  /?  =  VS,  the  above  condition  can  be  rewritten  as 


In  particular,  for  the  one-dimensional  case,  we  have 


*.!_ 

dx  i 


dx 


*X       ^  •  (5.41) 


Thus,  the  quasi-classical  approximation  turns  out  to  be  sufficiently 
accurate  in  cases  where  the  de  Broglie  wavelength  is  constant  or 
changes  very  little  over  distances  of  the  order  of  the  wavelength. 
Since 
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we  can  rewrite  the  condition  (5.41)  in  the  form 

<1,  (5.42) 


P* 

where  F  = — ^-  is  the  classical  force  acting  on  the  particle.   From 

this,  it  follows  that  the  quasi-classical  approximation  becomes 
inapplicable  at  small  values  of  the  momentum  and,  in  particular,  at 
the  points  where  a  particle  would  come  to  a  stop  in  the  classical 
theory  (E  =  V,  p  =  Q).  Such  a  state  of  affairs  is  obtained,  for 
example,  in  a  potential  well  at  the  points  where  the  direction  of  a 
particle  is  reversed  as  a  result  of  reflection  from  the  potential 
barrier  (the  turning  points).  A  simple  explanation  can  be  given  for 
this  conclusion:  namely,  the  de  Broglie  wavelength  tends  to  infinity 
as  p—+  0  and,  when  this  happens,  the  wave-like  properties  of  a 
particle  become  too  important  for  the  particle  to  be  treated  quasi- 
classically. 


D.    THE  WENTZEL-  .APPROXIMATION 


As  mentioned  above,  Eq.  (5.40)  is  completely  equivalent  to  the 
Schrodinger  equation.  Therefore,  it  would  be  possible  to  take 
Eq.  (5.40)  as  the  basis  of  the  wave  theory  by  treating  the  term 
which  is  proportional  to  h  and  which  does  not  appear  in  the  classical 
equation  as  a  new  quantum-mechanical  potential  energy 

yquan  =_^^^  (5.43) 

which  has  to  be  added  to  the  Hamilton-  Jacobi  equation.  The  general 
solution  of  the  nonlinear  Schrodinger  equation  (5.40)  is,  however, 
much  more  complicated  than  the  solution  of  the  linear  Schrodinger 
equation,  and  therefore  the  many  attempts  to  develop  the  quantum 
theory  by  means  of  further  analysis  of  Eq.  (5.40)  met  with  failure. 
Fortunately,  Wentzel,  Kramers  and  Brillouin  succeeded  in  finding 
an  approximate  solution  of  the  Schrodinger  equation  (5.  40)  by  taking 
only  the  terms  of  the  order  of  h.  This  solution  was  found  to  be 
applicable  to  a  number  of  problems  in  quantum  mechanics.  It  is 
referred  to  as  the  WKB  approximation. 

For  the  sake  of  simplicity,  we  shall  consider  only  the  one- 
dimensional  case,  assuming  that  the  potential  energy  is  a  relatively 
smooth  function  of  x  (see  Fig.  5.1).  For  particles  with  energy  E, 
the  range  of  variation  of  x  can  be  divided  into  two  regions.  In  the 
first  region  (A'<*O),  the  energy  E  is  greater  than  the  potential 
energy  (£>V),  and  in  the  second  region  (*>#0)»  E<^V*  It  is  obvious 
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that   E=l/(je0)at  the  boundary  (x  =  x^)  of  the  two  regions.   For  the 
one-dimensional  case,  the  original  equation  (5.40)  becomes 


—  ihS"  =  2m0  (E—V)  = 


(5.44) 


0  X=XQ  X 

Fig.  5.1.  Potential  V(or)  vs    A   by  the  WKB  method 


First,  we  shall  find  the  solution  of  this  equation  for  region  I 
(£>l/),  where  the  quantity  />2>0  can  be  interpreted  as  the  square 
of  the  classical  momentum.  The  solution  will  besought  in  the  form 
of  a  series 

S  —  SQ-{-Si  -f-Sa-f-  ...,  (5.45) 

where  the  quantity  Si}  is  independent  of  //,  S{  is  proportional  to  f?t  S> 
is  proportional  to  //2,  etc.  Substituting  the  series  (5.45)  into  (5.44) 
and  neglecting  quantities  proportional  toft2  and  to  the  higher  powers 
of  //,  we  obtain4 

s,;a  -|  -  2«$;>s;  —  /fts;/  =  />*.  (5.46) 

Taking  both  sides  of  the  equation,  we  equate  terms  that  are  inde- 
pendent of  //  and,  similarly,  terms  that  are  directly  proportional 
to  //  (here  it  is  necessary  to  bear  in  mind  that  the  quantity  SL  is 
proportional  to  h).  We  thus  obtain 


(5.47) 


Hence,  it  follows  that 


(5.48) 


This  approximate  method  of  solution  is  basically  the  same  as  the  perturbation  method, 
which  is  also  successfully  used  in  the  solution  of  the  Schrodinger  equation  (see  Chap- 
ter 14  below). 
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Therefore,  retaining  only  terms  of  the  order  of  #,  we  have 

AO 

S  =  St-\-Si  =  ±  (j  pdx  +  to  \nlf~p.  (5.49) 

A 

Substituting  (5.49)   into  (5.36),  we  obtain  the  following  expression 
for  the  wave  function  in  the  first  region  (A'<\YO): 

to  AO 

^^Q  =  -^(A  cos|  J  /;</*  +  fl  sin  ±  J  /x/A  (5.50) 


In  exactly  the  same  way,  for  the  second  region  (x^>x()),  in  which 
r  <^0,  we  obtain 


where 


The  wave  functions  (5.50)  and  (5.51)  are  the  desired  approximate 
solutions.  From  these  equations  it  is  seen  that,  when  I!^>V,  the 
wave  function  is  cosinusoidal  or  sinusoidal,  as  in  the  case  of  a 
potential  well  [see  Eq.  (4.21)]  or  a  free  particle  [sec  Eq.  (4.47)], 
whereas,  whenV>>Z:,  it  changes  exponentially,  as  in  the  case  of 
transmission  through  a  potential  barrier  [see  Eq.  (4.23)]. 

Comparing  the  solutions  for  V=  const  with  the  solutions  for  the 
case  where  the  potential  energy  is  a  function  of  v,  we  see  that  the 
transition  from  one  case  to  the  other  is  simply  equivalent  to  re- 
placing the  area  of  the  rectangular  barrier  contained  between  the  x 


axis  and  an  axis  indicating  the  constant  quantity  x  =  V^^V^J^  _  \JP\ 

n  h 

by  the  area  enclosed  between  the  x  axis  and  the  curve  for  V  -=  V  (x). 
Schematically  this  transition  can  be  represented  in  the  following 
manner: 


A  similar  transition  is  made  in  the  case  of  a  potential  well. 

Thus  the  specific  form  of  the  dependence  of  the  potential  energy 
on  x  does  not  alter  the  character  of  the  solution.  Indeed  the  solution 
is  determined  only  by  the  sign  of  the  difference  between  E  and  V, 
as  we  indicated  at  the  end  of  the  preceding  chapter. 

The  solutions  (5.50)  and  (5.51)  give  a  good  approximation  only 
for  regions  that  are  relatively  far  from  the  special  point  x{]  (the 
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classical  turning  point)  where  the  quantity  p\  is  relatively  large. 
Near  the  special  point  (*-»*„)  the  quantity  p2->0;  therefore,  the 
denominator  in  Eqs.  (5.50)  and  (5.51)  vanishes  and  the  actual 
solution  diverges.  If  we  could  express  the  coefficients  C  and  Dm 
terms  of  A  and  B,  the  foregoing  approximation  would  be  entirely 
adequate  for  many  problems,  since  the  region  \x  —  JC0|  —  0  is  com- 
paratively narrow.  However,  the  relation  between  these  coefficients 
can  be  found  only  if  we  can  connect  the  functions  across  the  boundary 
(x  =  x0)  of  the  regions  (by  connection  of  the  solutions  we  mean 
matching  the  wave  functions  and  also  their  first  derivatives  at  the 
boundary  x  ->*0).  The  approximate  expression  for  <p  must,  there- 
fore, be  represented  in  such  a  form  that  at  large  pl  Eq.  (5.50)  holds, 
while  for  *=*<»  when 


p*  =  —  (x  — 
the  approximate  solution  satisfies  the  equation 


(5.53) 


At  large  z,  the  cosine  function  can  be  expressed  asymptotically 
in  terms  of  an  nth  order  Bessel  function: 


'-(*>•  <5-54> 

Therefore,  if  we  set 

*0 

z=^pdx,  (5.55) 

X 

then 

<(W0  =  ^J/f.Uz).  (5.56) 

For  large  z  (and  any  n),  because  of  the  asymptotic  formula  (5.54), 
the  solution  (5.56)  transforms  into  the  solution  (5.50)  found  by  the 
WKB  method.  Let  us  attempt  to  choose  the  order  n  of  the  Bessel 
function  in  such  a  way  that  the  solution  (5.56)  satisfies  the  Schro- 
dinger  equation  not  only  for  large  z,  when  x  --,Jt0,  but  also  near  the 
turning  point  x—  *x0,  that  is,  when 


.  (5.57) 

In  this  case  (X->XQ  —  0),  the  asymptotic  expression  for  ^  becomes 

(5.58) 
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Substituting    (5.58)    into   (5.53),    we   find   that    Jn  must  satisfy  the 
equation 


Introducing  the  new  variable  z=  ~^-(x0  —  xY^  (the  argument  of  the 

Bessel   function)      into  this  equation,  and    denoting  by  primes  the 
derivatives  with  respect  to  this  argument, 

.  =  0.  (5.59) 


If   the  Bessel  function  is  to  satisfy  Eq.  (5.59)  and  if,  at  the  same 
time,  the  wave  function  is  to  obey  the  asymptotic  equation  (5.53), 

we  must  put  /j  =  ;±~.    Thus,  instead  of  the  approximate  solution 
(5.50),  we  have 


*=y  a 


-  <5-60> 

Similarly,  in  place  of  (5.51),  we  obtain  for  the  second  region  (E  <^V) 

(|  $\P\d*)  + 

.V<> 

)},  (5.61) 

•Vfl 

where  /i/3  and   /_  i/8  are  the  Bessel  functions  of  an  imaginary  argu- 
ment. 

In  order  to  connect  the  two  solutions,  we  must  find  the  asymptotic 
forms  of  Eqs.  (5.60)  and  (5.61)  for  the  region*--  **o.  The  appro- 
priate values  of  z  andp  are  determined  from  (5.57).  For  the  Bessel 
function,  it  is  enough  to  take  only  the  first  term  of  its  expansion: 


The  solutions  (5.60)  and  (5.61)  become,  respectively, 
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Connecting  the  solutions  at  the  point  x  =  x(},  we  find 


Considering  the  asymptotic  forms  of  the  ordinary  Bessel  function 
[see  (5.54)]  and  of  the  Bessel  function  of  imaginary  argument5 


cos 


-  64> 


SThe    Bessel    function  of  imaginary   argument   is   related  to   the   Bessel   function  of  the 
first  kind  by  the  equation 


where 


The   asymptotic    form   of  the   Bessel  function  with  Jt         |rk         ran    DP  calculated  either  for 


0  f  0 


or  for  </>  0 

2 


J2     cos/x  ,     1    7^(^,1/2))       fo 
1/7*  V          2  / 


r         ^(/)'    -77,  (5  64a) 

2  2 


cosf.x  77(/j  I   V'2))    tor   -  -  --  0^   -  (S  64b) 

\          2  /  2  2 

(see,   for  example,    P.    Morse,  and  H    Feshbach,  ^f^'//JO(/s  of  Theoretical  /'/ns/rs,  New  York 
McGraw-Hill,    1953,   Vol     I,   p    622)      Therefore,   the  asymptotic  form  of  fn  for  real  ?  has  a 
discontinuity    (Stokes    phenomenon)       Indeed,    using  (5  t>4a)   and   (5  64b),    we   find   for  the 
two  cases 


( 

V 


for    6      -  F  0,  (5  64c) 

2 


V*  f  ^Z.-     Tr/   "  '       /    for    0  -    -  -  0 


(5  64d) 


Unfortunately,  many  authors,  including  Morse  and  Feshbach  (see  Methods  of  Theoretical 
P/ivsirs.  Vol  II,  p  1097),  use  (5  64c)  in  analyzing  the  passage  of  particles  through  a  po- 
tential barrier  and  thus  obtain  a  complex  asymptotic  expression  for  the  real  function  fn(^)- 

The  correct  procedure  at  </>          (where  the  function  has  a  discontinuity)  is  to  take  the  half- 

2 

sum  of  (5  64c)  and  (5  64d)  in  order  to  obtain  the  asymptotic  expression  (5  64),   which,   as 
expected,  is  real 
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and  also  taking  account  of  the  relation  between  the  coefficients,  we 
find  that  for  large   z,  the  formulas  (5.60)  and  (5.61)  take  the  form 


(5.65) 


|  f  |  dX 


Setting  B  =  A  —  — ~  in  the  last  equality,  we  find  the  first  pair  of 
connected  solutions 


(5.66) 


for  which  the  exponentially  decreasing  solution  (5.67)  in  the  region 
x>jcft  is  the  analytic  extension  of  the  sinusoidal  solution  (5.66)  for 
the  region  x  <\YO. 

To  determine  the  analytic  extension  of  the  exponentially  increas- 
ing solution  (#>#<)),  we  must  set  B  =  -  A—b.  We  then  obtain  the 
second  pair  of  connected  solutions 


=  -r^-  cos  (i  J  pdx  +  -;-)  ,  (5.68) 


(5'69) 


According  to  Eqs.  (5.66)  and  (5.68),  the  expression  for  <|>  in 
region  I  (X<^XQ)  is  of  the  form  oi  standing  waves.  It  can  also 
be  written  in  the  form  of  traveling  waves.  Indeed,  setting 


90 

we  have 
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*° 


(5.70) 


The  appearance  of  the  factor  Y p  in  the  denominator  of  (5.70)  means 
that  the  probability  of  finding  a  particie  in  a  unit  volume  (that  is,  a 
quantity  proportional  to  |'^|2)  is  smaller,  the  greater  the  velocity. 
Inside  the  barrier  the  exponental  solution  that  is  connected  to  (5.70) 
takes  the  form 


I  p  \dx 


(5.71) 


In  cases  where  the  potential  barrier  is  located  to  the  left  of  the 
turning  point,  we  must  interchange  the  limits  of  integration,  so  that 
the  lower  limit  will  always  be  less  than  the  upper  limit. 

On  the  basis  of  these  results,  we  are  able  to  quantize  the  problem  of  the  potential  well 
(that  is,  find  the  energy  levels  of  the  particles)  in  the  WKB  approximation.  Let  us  assume 
that  we  have  a  potential  well  of  arbitrary  shape,  as  illustrated  in  Fig.  5.2.  Obviously, 


zr  ~T  x=x      x 

Fig.   5.2.  Quantization    in    a    potential  well  according 
to  the  WKB  method 


the  process  of  quantization  in  the  WKB  method  will  consist  in  finding  the  conditions  under 
which  the  exponentially  increasing  solutions  on  both  sides  of  the  potential  barrier  ( A*  <  x t 
and  .v  :>  .Vj.)  vanish. 

According  to  (5.66),  the  wave  function  for  this  case  has  the  following  form  in  that 
region  of  the  potential  well  which  is  adjacent  to  the  boundary  of  the  barrier  at  A*  =  xt 
(there  being  only  an  exponentially  decreasing  solution  inside  the  barrier): 


(5.72) 
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Because  of  the  requirement  of  an  exponentially  decreasing  solution  inside  the  potential 
barrier  in  the  region  v  w  .vlt  the  following  solution  is  obtained  for  the  potential  well  in  the 
region  .v:>.Vii 

X 

(5.73) 


Two  solutions  must  be  identical  at  any  arbitrary  point  x  in  the  potential  well  (v,  <.v 
as  long  as  we  do  not  take  points  too  close  to  the  boundary  of  the  potential  barrier*  Joining 
the  solutions  (5.72)  and  (5.73)  at  some  point  t  (that  is>  matching  the  two  wave  functions 
and  also  their  derivatives),  we  have 


*'  sin 


a'  cos 


If  this  system  of  homogeneous  equations  is  to  have  a  nontrivial  solution  for  a  and  «',  the 
determinant  of  the  system  must  vanish.  We  then  obtain  the  relation 


*+  T)=°> 

X  Xi 

X*  X 

(l  J,rf*  +  i)  +  «  cos(I 


? 

Tlie  integral  \  p  dx  cannot  be  negative,  since  p  =  y  2m0  (£  —  I/)  ^  0.  Hence 


(5.74) 


Thus    the    quantization    rules   obtained   from   the    WKB   approximation  (that  ist  with  an 
accuracy  up  to  the  terms  of  the  order  of  ft)  have  the  form 


p  dx  =  2n1l    n  + 


--  V  (5.75) 


These  quantization  rules  differ  from  the  Bohr  quantization  rules  by  the  presence  of  a 
nonzero  term  -^  '2xh,  which  corresponds  to  the  lowest  state  (n  =  0).  A  more  exact  solu- 
tion of  a  similar  problem  in  wave  theory  (for  example,  the  harmonic  oscillator  problem) 
shows  that  a  zero-point  energy  necessarily  exists,  although  it  does  not  affect  the  radiation 
spectrum. 

Let  us  now  find  the  normalization  coefficient  of  the  wave  function  for  the  case  of  a 
potential  well.  In  normalizing  the  function,  we  can  restrict  ourselves  to  integration  over 
the  interval  x\  ^  x  ^  A'a  (potential  well),  since  <|>  decreases  exponentially  everywhere 
outside  this  region.  The  normalization  coefficient  can  be  then  found  from  the  equation 

:  1.  (5.76) 

Since  the  sinusoidal  function  oscillates  rapidly,  its  square  can  be  replaced  by  its  average 
value,  1/2,  without  significantly  affecting  the  accuracy.  We  then  have 

(5.77) 
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2r. 

The  oscillation  period  t  =  — '(<*>  is  the  angular  frequency)  is 

A  j  -Vj 

I*   dx 


25  =  2  - 

»  J    » 


where  0  =  ^  is  the  velocity  of  the  particle.  Hence,  for  the  normalization  coefficient,  we 

MQ 

obtain  the  expression 


Consequently,  the  eigenfunction  (5.73)  in  the  WKB  approximation  assumes  the  form 

-9—         /  1    £  ^\ 

/'O  IV  f        i       "   \  /<^  7ft\ 

nv  ™(n  ]  MV  +  -V).  (5-78> 


Pro^em  5-Ll   Determine  the  eigenfunctions  and  eigenvalues  of  the  energy  of  a  particle 
iiTa^mree-aimWsional  potential  well  bounded  by  an  infinitely  high  potential  barrier: 


»  «)  = 


oo    outside  the  potential  well. 


Find  the  conditions  under  which  different  wave  functions  correspond  to  the  same  energy 
value  /:,  that  is,  the  conditions  under  which  the  energy  levels  are  degenerate. 
Answer. 

E          _.***'! /;ii    i  rtl    i  'Jl 
'nLn»nt     '  2mQ  \  l{     '    /j    '     l\ 

/8  \r 

---      sin^—1-  sin  -  a-  -  sin  — .— 

(/ilf  /is,  /!,=  !,  2,  3,...). 

In   particular,   when  /a  =  /i,   we   have   degeneracy  for  the  case  of  the  two  different  wave 
functions fyn  ntn  and  fyn,n  n  : 

Problem  5.2.  \Find  two  classes  of  solutions  (symmetric  and  antisymmetric)  for  a 
pall'flcieTl^S^BWfw^lmensional  symmetric  potential  well: 

I/o  for       x  <:  —  /     (region  I) 

0     for   —  /  <  A-  <:  /  (region  II)  (5.79) 

l/o  for        .v  >  /        (region  III) 

Show  that,  when  l/o  can  have  any  arbitrary  value,  only  the  symmetric  solution  must 
always  have  at  least  one  energy  levei.  Find  the  condition  under  which  an  antisymmetric 
solution  is  possible.  Show  that  the  antisymmetric  solutions  corresponding  to  ,v>0  also 
represent  a  complete  solution  of  the  system  in  the  case  where  V  (x)  is  described  by 
the  function 


oo  for      x  <.  0, 
0    f  or  0  <  x  <  /, 
VQ  for      x>l. 


(5.80) 


Theso  problems  are  based  on  the  material  contained  in  both  Chapters  4  and  5 
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Plot    a    graph    of   the   potential   energy   and   of  the  symmetric  and  antisymmetric  wave 
functions. 

Solution.  The  wave  functions  in  regions  I,  II  and  III  have  the  form 


==.  Ai  sin  kx  +  83  cos  kx    (—  I  <  x  <  /), 


where 


The  exponentially  increasing  solutions  have  been  discarded  because  they  diverge  (indeed, 
this  is  the  factor  that  is  ultimately  responsible  for  the  discrete  values  of  the  energy). 

From    the    boundary    conditions    at  *  =  T/,  it  follows   that  ^  ln(,r)  =  <|/n(*)  and 

*  !«  ni^==.    i"  W  t    From  this,  we  find  that  it  is  possible  to  have  either  a  symmetric 

dx  dx 

solution,   for   which   fy(x)  =  fy( — A),   or   an   antisymmetric   solution,  for  which  fy  (x)  = 

=  -<(/(— x). 

For  a  symmetric  solution  /4a  =0,  C8  =  Di  f   and   the  energy  levels  are  found  from 

the  equation  

tan.=.y"£^.,  (5.81) 

where 


Since  ]/p8  —  as/a  >  0,  the  minimum  value  of  the  angle  a  must  lie  in  the  first  quandrant 
and  can  be  found  from  the  condition 


This  equation  will  always  have  one  root  for  any  value  of  p  in  the  region  0  <  o  <   — ,  as  is 

easily  shown  graphically  from  the  fact  that  £  =  const  >  0. 

For  the  antisymmetric  solution  [^  (x)  =  —  '}  ( —  x)]t  we  have  Bt  =  Q  and  Ct  =—  /?i. 
The  energy  levels  are  determined  from  the  equation 


tan  a  =  -- 


(5.82) 


Since  the  right-hand  side  of  (5.82)  has  a  negative  signt  the  minimum  value  of  the  angle  a 
must  now  lie  in  the  second  quadrant  and  can  be  found  from  the  condition 


tan  a  a 

Sin  a  =   —  --       _  ^rrr-^  =    —  . 

/l+tan'a       P 


Introducing  the  notation 


where  the  angle  7  lies  in  the  first  quandrant  ,  we  find 
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It  is  evident  that  this  equation  has  at  least  one  root  only  when 

P>-rr.      l.e.  t  for 

Since  the  wave  function  of  the  antisymmetric  solution  vanishes  at  A'  =  0t  the  antisymmetric 
solution  for  A:>0  is  also  the  solution  for  a  particle  In  a  potential  field  described  by 
(5.80). 

.mi  •*• 

.Problem, 5.3.\  Find  the  transmission  coefficient  D  of  a  particle  through  a  potential 
bamUl  U!  IMliaii^ular  shape  (see  Fig.  5.3),  if  the  particle  energy  /;  is  less  than  the  height 
of  the  potential  barrier  K0. 


Vffl 


Transmitted 
wave 


Fig   5.3  Transmission  of  a  particle  through  a  potential 
barrier 

Solution,  The  solution  of  the  Schrodmger  equation  for  the  various  regions  has  the  form: 

^rr-./V**  -I-  flie-1**     for    A-  <  0  (region  I) 
<h,--=/\;,f   **   1   H»e*x     for     0  <  A*  <a  (region  II) 

(region  III) 


characterize  the  incident  and 


tkl*-t»     for 

Here  k*^~m"  ',     x8=  -'"2°  (K0  --/:),  /\^I/ft  and  /? 

reflected  waves,  respectively;  /l-j?I/M  v~rt)  characterizes  the  transmitted  wave;  and 
/V  '*«V-<D^  characterizes  the  reflected  wave  coming  from  infinity.  Since  we  have  no 
reflected  wave  from  infinity  in  our  case,  we  must  set  Z/8  =  0, 

'lo  determine  the  transmission  coefficient  we  shall  use  the  boundary  conditions  at 
A*  =  fl  and  .v  —  0.  We  first  express  /13  and  B*  in  terms  of  /18,  making  use  of  the  fact  that 
xu  ;  I  : 


and  then  express  AI  In  terms  of  As 


The  transmission  (diffusion)  coefficient  D  is  then  found  to  be 
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where 


Neglecting  the  second  term  in  the  exponential  for  D(this  is  possible  because  the  quantity 
-y- p— ^-is  only  slightly  different  from  unity),  we  finally  obtain 


When  ft  =  0,  we  get  the  classical  result  D  =  0. 

r~ '        "*\ 

j  Problem  5.4.1  A  particle  is  in  a  potential  field  of  the  form 


Ion,      .v  <  0, 

0,    0^.\;<zl  (region  I) 

V0t  /<*</!  (region  II)  (5-83) 

0,       /i<*  (region  III) 

a)  Show  that  the  spectrum  is  continuous. 

b)  Show   that  for  */  >  I  and  /  (/i  —  /)  =  ra  >  1  (0  <C  E  ^~K0)  there  must  exist  quasi- 
levels,    that    is,    states    such    that   there   is    only  a  transmitted  wave  in  region  III  (see 
Problem  5.3X 

c)  Construct  a  graph  of  the  wave  function  corresponding  to  a  quasi-level. 

d)  Find    the    squared    modulus    of    the   wave   function  inside  the  potential  well  and 
explain  why  it  decays  exponentially  with  time. 

Solution.    In  the  various  regions,  the  wave  function  has  the  following  form  (see  the 
notation  of  Problem  5.3): 

^•s=sA\  sin  kx> 


From  the  boundary  conditions,  we  find 


These  relations  hold  for  any  value  of  E,  and  therefore  the  spectrum  is  continuous. 

We  obtain  quasi-levels  by  requiring  that  there  should  be  only  a  transmitted  wave  in 
the  external  region  (region  III),  that  is  to  say,  only  a  wave  moving  in  the  positive  A  direc- 
tion. Setting  /)8  =  0,  we  obtain  the  following  equation  for  the  energy  of  the  quasi-levels: 


-2 


x    /F^?  +  fa  (tana  _          «         V  (5.84) 

Y  p  _  a*  —  h  \  W*  —  **} 

It  should  be  noted  that  the  amplitude  C*  in  this  case  is  much  smaller  than  the  amplitude 
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For  <J  =  oo,  Eq,  (5.84)  becomes  exactly  the  same  as  Eq.  (5.82)  which  gives  the  energy 
levels  of  the  potential  well  (5.80X 

We  let  t:Q  denote  the  value  of  the  energy  of  a  particle  in  the  potential  well  which  we 
obtain  from  (5.82)  and,  for  present  purposes,  we  disregard  all  real  corrections  to  £0,  as 
they  make  no  essential  difference  to  the  problem.  We  then  obtain  the  following  expression 
for  the  energy: 

1 
E^£0— ~MA,  (5.85) 

where 


12/ 
and  where 


under  the  condition  */  3*  \. 

The  presence  of  an  imaginary  part  in  the  expression  (5.85)  for  the  energy  indicates 
that  the  wave  function  decays  exponentially  with  time.  The  transmission  coefficient 
through  the  potential  barrier  for  this  case  can  also  be  found  from  (5.85).  Indeed,  the 
squared  modulus  of  the  wave  function  inside  the  well  is 


and  thus  X,  which  is  called  the  decay  constant,  characterizes  the  decrease  of  the  prob- 
ability of  finding  the  particle  inside  the  potential  well.  The  quantity  X  is  related  to  O,  the 
transmission  coefficient  for  a  single  collision  of  the  particle  with  the  potential  barrier, 
by  the  equation 


where  ,y-  is  the  number  of  collisions  with  the  barrier  per  unit  time.  Hence  we  obtain  an 
expression  forD: 


rlhe  same  expression  for  D  was  obtained  in  another  way  in  Problem  5.3.  In  Chapter  6  we 
shall  obtain  it  for  the  general  case  in  which  the  potential  energy  is  an  arbitrary  function 
of  the  coordinates. 

The  constant  X  also  determines  the  duration  of  radiation   *rad~   ,   (see  also  Problem 
3.4).   Therefore,  the  quantity  A£  ^  Jl\  should  characterize  the  width  of  an  energy  level. 


Chapter   6 

Basic  Principles  of  the  Quantum  Theory 
of  Conductivity 


A.    TRANSMISSION  OF  A  PARTICLE  THROUGH  A 
POTENTIAL  BARRIER  (TUNNEL  EFFECT) 

According  to  the  classical  theory  a  particle  can  be  located  only 
at  those  points  in  space  where  the  potential  energy  V  is  less  than 
its  total  energy  £.  This  follows  because  the  kinetic  energy  of  a 
particle 

-£-=£— V  (6.1) 

2tn0  *       ' 

must  always  remain  positive.  In  a  region  where  V  >  E  (a  potential 
barrier),  the  momentum  has  an  imaginary  value  and,  classically, 
the  particle  cannot  exist  in  such  a  region.  Therefore,  if  two  regions 
of  space  in  which  E^>V  are  separated  from  one  another  by  a 
potential  barrier  inside  which  V  >  /:,  the  classical  theory  does 
not  allow  a  particle  to  penetrate  from  one  region  into  the  other. 
In^the^  wave  theory,  however,  an  imaginary  value  of  the  TQpmen- 
typ^(see  the"  WKfe  Approximation*  metFTodT*  CHlgleTj?^  simply 
corresponds  "fib  arT  exponential  <3ep0TidelTce  of  ffiejvave  functtonJJrj 
tKe^co?TrdInafes/"Since  the  WaveTufifctioh  does  not'vahisK  inside  the 
potential  barrier,  it  is  quite  possible  for  a  particle  to  leak  through 
it.  This  phenomenon  is  observed  in  the  case  of  microparticles. 
The  penetration  through  a  potential  barrier  is  called  the  tunnel 
effect.  It  is  a  specifically  quantum-mechanical  effect  and  has  no 
analog  in  classical  mechanics. 

With  the  WKB  method,  it  is  a  relatively  simple  matter  to  deter- 
mine the  probability  of  penetration  of  a  particle  through  a  potential 
barrier  of  an  arbitrary  but  sufficiently  smooth  shape  (see  Fig.  6.1). 

Let  us  assume  that  a  particle  is  moving  in  the  direction  of 
positive  x  in  the  region  I  (—  co<#<\Vi)t  where  E^>V(x).  It  en- 
counters a  potential  barrier  (*i<*<*j),  where  E<^V(x),  at  the 
point  *  =  *!,  and  then  falls  into  the  region  III  (*>*J,  where  again 
£  >  V  (x).  The  beginning  and  end  points  of  the  potential  barrier  can 
be  found  from  the  condition 

I' (*)  =  £.  (6.2) 
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The  de  Broglie  waves  corresponding  to  the  motion  of  this  particle 
will  be  partially  reflected  from  the  potential  barrier  and  partially 
transmitted  through  it.  The  transmitted  waves  will  then  propagate 
in  the  region  III  (A:^>JCJ.  To  determine  the  probability  of  penetra- 
tion of  a  particle  through  the  potential  barrier,  let  us  begin  by 


Incident  wave 


Transmitted 
wave 


Kig    (>   I     Schematic  diagram  of  a  potential  barrier 

of  an  Arbitrary    but  sufficiently  smooth  shape 

The    im  ident    and    transmitted    waves    are    repre- 

sented by  the  solid  curve,   and  the  reflected  wave 

by  the  dashed  curve 


analyzing  the  wave  in  region  III,  where  the  solution  has  the  simplest 
form,  since  in  this  region  there  can  be  no  wave  moving  in  the 
direction  of  negative  A'.  According  to  Eq.  (5.70),  the  solution  in 
region  III  has  the  form 


.... 


where 


(6.4) 


We  define  the  transmission  coefficient  as  the  absolute  value  of  the 
ratio  of  the  flux  density  of  the  particles  transmitted  through  the 
barrier  to  the  flux  density  of  the  incident  particles: 


D  — 


_|  A, 


Jmc 


(6.4a) 


To  determine  the  particle  flux,  we  shall  make  use  of  Eq.  (5.23). 
Setting  the  constants  e  and  m0  equal  to  unity  (which  we  are  allowed 
to  do  because  we  are  interested  only  in  the  ratio  of  the  particle 
fluxes),  we  obtain 
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Substituting  (6.3)  into  (6.5),  we  find 

i    =  \Bm\*     for     ''m^0' 


/in  =-1*111 14     for    8m=°  <6-7> 

It  is  seen  from  the  above  that  the  amplitude  gin  characterizes  the 
wave  propagating  in  the  direction  of  positive  x,  and  the  amplitude 
hm  the  wave  propagating  in  the  direction  of  negative  x.  Since,  as 
has  been  already  mentioned,  there  is  no  wave  propagating  in  the 
negative  x  direction  in  region  III,  we  must  set  //m  =  0.  Then 

and  the  corresponding  joined  solution  inside  the  potential  barrier 
(region  II)  can  be  found  from  Eq.  (5.71).  Setting  /j~(J  and  g  —  g\\\  in 
Eq.  (5.71),  we  find 

V*3  *J 

If  I      f 

'      J  j  Jl    J  -i  j»j-k» 

i       I  I  tC    O\ 

x  _  i  _L  ,j  p       x  (o.yi 

i    2  snr  J,  l      ' 

where 

|  p  |  =  Y2tfh(V~^E).  (6.10) 

Using  the  equation 


where 

Jf2  .Vj 

r  =  ±  J  |  p  i  rf*  ==  -J-  J    lf2Jtfy=~E~)~<tx ,  (6.12) 


the  solution  (6.9)  can  be  put  into  a  form  such  that  it  can  be  connected 
to  the  solution  in  region  I  by  means  of  Eq.  (5.70).  Thus  we  have 


'  dx 
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According   to    Eq.    (5.70),  the  solution  in  region  I  (x<^x{)  has  the 
form 


Substituting  this  solution  into  (6.5),  the  following  expressions  for 
the  incident  and  reflected  waves  are  obtained: 


/ref=—  lft|a.  (6.16) 

Equations  (5.70)  and  (5.71)  can  be  used  to  relate  the  coefficients 

with 


ffi  +  Ai  =  Tft»fi""T'  (6-1?) 

ft-Ai  =  -2/gm^.  (6.18) 

From  this,  we  find 

(6.19) 


In   accordance  with  the  definition  of  the  transmission  coefficient 
(6.4a),  we  have 

£>^M'tr    l_  I  gill  Is  _  1  (6.21) 

*- 


In  exactly  the  same  way,  we  find  the  reflection  coefficient 


(6.22) 


From  these  formulas,  it  follows  that  the  sum  of  the  transmission 
and  reflection  coefficients  is  equal  to  unity: 

=\.  (6.23) 
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For  cases  where  the  quantity  7  is  much  larger  than  unity  (these 
are  the  only  ones  of  practical  interest),  the  transmission  co- 
efficient (6.21)  is  given  by  the  expression1 


-E)  dx 


D  ^  <r2T  =  e 


(6.24) 


In  the  classical  limit  (h  —  Q),  it  is  evident  that  the  transmission 
coefficient  becomes  zero,  as  we  would  expect,  and  thus  the 
penetration  of  particles  through  the  potential  barrier  is  impossible. 


/ 
Glass 


B.  THE  TUNNEL  EFFECT  AS  A  MANIFESTATION  OF 
WAVE  PROPERTIES 

The   penetration   of  particles  through  a  potential  barrier  is  a 
typical    manifestation  of  their   wave   properties.    Therefore,    an 
analog    of  this    effect   must   occur    in 
every  type  of  wave  theory. 

In  optics,  this  analog  is  the  well- 
known  phenomenon  of  total  internal 
reflection,  which  occurs  when  light  is 
reflected  from  an  optically  less  dense 
medium.  Let  us  assume  that  a  ray  of 
light  propagating  in  glass  strikes  a 
glass-air  interface  (air  being  the  op- 
tically less  dense  medium).  (See  Fig. 
6.2.)  Then  the  wave  field,  which  can 
be  characterized  by  the  electric  or 
magnetic  field  strength,  is  described 
in  glass  (where  the  index  of  refraction 
n^>  1)  and  in  air  (n  —  1),  by  the  follow- 
ing equations,  respectively: 


Air 


Fig.    6.2.    Propagation    of    light 

from    a    more    dense    to    a    less 

dense  medium. 


'f -~(.vsm0n-|-j'c 


(6.25) 
(6.25a) 


In  solving  the  similar  problem  of  penetration  of  a  particle  through  a  rectangular  poten- 
tial  barrier  (see  Problem  5  3),  we  obtained  the  same  exponent  as  in  (6  24),  but  in  front  of 

16rc2          /            I      F       \ 
the  exponential  there  was  a  factor  — — —       In  --    */ ,  which  was  of  the  order  of 

<1  +  ,,V      \         VV0-RJ 

unity      In  the  case  of  a  smooth  barrier  (that  is,  a  barrier  exerting  a  force  which  is  continu- 
ous at  all  points),  this  factor  becomes  exactly  unity 
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Equating  these  functions2   at  the  interface  (at  the  plane y  = 
we  obtain  the  familiar  laws  of  refraction: 


i  — ft  sin  Oj; 


Substituting   these   values    into    (6.25a)    we   find  for  the  refracted 
wave 


\  ---  -  (vrt  sm 


V\ 


From  this  it  is  seen  that,  if  n^mB{<^\9  an  ordinary  wave  will  be 
propagated  in  region  II.  In  the  case  where  «smOi>  1,  we  have  total 
internal  reflection,  the  physical  analysis  of  which  can  not  be  given 
on  the  basis  of  the  laws  of  geometrical  optics.  From  the  standpoint 
of  physical  optics,  which  accounts  for  the  wave  properties  of  light, 
the  electric  and  magnetic  fields  are  exponentially  decaying: 

(1)1  .  ._  /U)[ 

y/ia  sma  Oj  —  1  y  —  uojf  -|- xn  sin  Oj 

If  in  the  case  of  total  internal  reflection  the  refracted  wave  en- 
counters a  second  glass  surface  (region  III)  (that  is,  we  have  two 
pieces  of  glass  separated  by  an  air  layer),  then  in  region  III  the  wave 

is  again  propagated  according  to  Eq. 
(6.25).  Its  amplitude,  however,  will 
be  an  exponentially  decreasing  func- 
tion of  the  width  of  the  air  layer  / . 

(6.26) 

A  diagram  of  the  tunnel  effect  in 
optics  is  shown  in  Fig.  6.3. 

Before  concluding  the  physical 
analysis  of  the  penetration  of  par- 
ticles through  a  potential  barrier, 
we  should  also  consider  the  so- 
called  "tunnel  effect  paradox."  This 
paradox  lies  in  the  fact  that,  at  first 
glance,  it  seems  as  though  the  real 
classical  particles  inside  the  poten- 
tial barrier  are  in  a  peculiar  state  characteristized  by  an  imaginary 
momentum.  However,  it  is  important  to  remember  that  in  this 
purely  quantum-mechanical  phenomenon,  it  is  only  the  probability 
of  a  particle  being  somewhere  inside  the  barrier  which  decreases 
exponentially  as  we  recede  from  the  boundary  into  the  potential 
barrier.  Inside  the  barrier,  the  momentum  and  position  of  a  particle 
are  real,  and  are  both  given  within  the  framework  of  the  ordinary 


Wave 

^transmitted 
through 
barrier 


Fig.  6.3.  Schematic  diagram  of  the 
tunnel  effect  in  optics  (total  inter- 
nal reflection). 


We  shall  not  write  here  the  expression  for  the  reflected  wave. 
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uncertainty  relation.  In  order  to  show  this,  let  us  consider  a 
potential  barrier  with  constant  V.  In  the  first  approximation,  the 
wave  function  inside  the  barrier  changes  according  to  the  equation 

?II==^-W,  (6.27) 

where 


—  E)  =  const.  (6.28) 

The  right-hand  side  of  (6.27)  can  be  represented  in  the  form  of  a 
Fourier  integral 


e-**=  \f(k)coskxdk,  (6.29) 

ff 
where 

W>=S7?TF7'  (6'29a) 

which  means  that  it  can  be  represented  as  a  set  of  wave  functions 
having  real  momenta.  Obviously,  the  amplitudes  /  (k)  will  effectively 
differ  from  zero  only  when  k  varies  in  the  range  from  0  to  x. 
Thus  the  uncertainty  in  momentum  is 

Ap  ^ftx. 

According  to  wave  mechanics,  the  position  of  a  particle  inside 
the  potential  barrier  can  be  determined  only  to  an  accuracy  within 
the  order  of  the  width  of  the  barrier: 


Multiplying  Ap  and  A*  we  obtain 


Since  our  equations  are  valid  only  for  the  case  y/  >  1  ,  the  accuracy 
in  determining  the  momentum  of  the  particle  and  its  position  inside 
the  potential  barrier  will  not  contradict  the  uncertainty  relation. 


C.    MOTION  OF  ELECTRONS  IN  A  METAL3 

The  theory  of  the  tunnel  effect  has  a  number  of  very  important 
applications  both  in  the  theory  of  metals  and  in  nuclear  physics. 
On  the  basis  of  this  theory  it  is  now  possible  to  explain  a  number 
of  phenomena  which  could  not  be  accounted  for  in  classical  physics, 

3See  H.  Bethe  and  A.  Sommerfeld,  Elektrontheorie  der  Metalle,  Handbuch  der  Physik, 
Berlin:  Springer,  1933,  Vol.  24,  part  2. 
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such  as  cold  emission  (the  emission  of  electrons  from  a  metal 
under  the  action  of  an  electric  field),  contact  potentials,  etc.  Before 
discussing  these  phenomena,  we  shall  say  a  few  words  about  the 
theory  of  an  electron  gas,  which  underlies  the  electron  theory  of 
the  conductivity  of  metals. 

The  high  conductivity  of  metals  indicates  that  electrons  are  able 
to  move  relatively  freely  inside  the  entire  crystal  lattice  of  a  metal. 
Their  excape  from  the  metal  into  vacuum  is,  however,  hindered 
because  this  requires  the  expenditure  of  a  certain  energy,  the  so- 
called  "work  function."  This  suggests  that  as  a  first  approx- 
imation, we  may  simply  consider  the  metal  as  a  potential  well, 
inside  which  the  potential  energy  of  an  electron  can  be  taken  as 
equal  to  zero  (I/  —  0),  while  outside  the  metal  (that  is,  in  a  vacuum) 


This  simplified  model  enables  us  to  explain  several  phenomena 
occurring  in  metals.  Some  of  its  fundamental  results,  obtained  for 
the  case  of  free  electrons,  can  be  extended  (with  the  help  of  quantum 
mechanics)  to  include  the  periodic  field  of  a  crystal  (see  below  the 
simplest  one-dimensional  Kronig-  Penney  model,  which  correctly 
describes  at  least  the  qualitative  aspects  of  many  phenomena). 

The  electron  gas  model  of  a  metal  was  first  considered  in 
classical  theory  (the  theories  of  Drude,  Lor  entz  and  others).  In  this 
version  of  the  model,  the  classical  Maxwell-Boltzmann  statistics, 
which  had  successfully  explained  many  phenomena  in  the  kinetic 
theory  of  gases,  was  now  applied  to  electrons.  However,  the  elec- 
tron gas  model  encountered  great  difficulties  in  developing  a 
theory  of  specific  heat.  In  accordance  with  the  theorem  of  equipar- 
tition  of  energy,  well-known  from  classical  statistical  mechanics, 
each  degree  of  freedom  must  have,  on  the  average,  an  energy4 

l£av  =J  kT,  (6.30) 

where  k  is  Boltzmann's  constant.  From  this,  it  is  evident  that  the 
contribution  of  each  tree  electron  to  the  total  specific  heat  will  be 
the  same  as  that  of  a  free  atom: 


This  contradicts  the  experimental  facts  which  indicate  that  the 
specific  heat  of  a  monatomic  metal  is  the  same  as  that  of  the  lattice 
atoms;  that  is,  in  the  first  approximation,  free  electrons  make  no 
contribution  to  the  specific  heat  of  a  metal. 


The   specific  heats  of  monatomic   substances  will   be  considered  in   greater  detail   in 


Chapter   12  [see  formula  (12.66)]. 
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This   contradiction  was  resolved  by  Sommerfeld,  who  showed 
that  electrons  in  a  metal  do  not  obey  the  classical  distribution 


Instead,    the    distribution   is    characterized   by   the  Fermi-Dirac 
distribution  function 

/F-D=-,—  -!. 


_!.  ,*/«•+, 

The  Fermi-Dirac  quantum  statistics  is  based  on  the  Pauli 
exclusion  principle,  according  to  which  each  energy  level  can  be 
occupied  by  at  most  two  electrons  (two  quantum  states  which  differ 
only  by  the  direction  of  spin). 

If  we  are  given  a  three-dimensional  potential  well  of  a  cubic 
shape,  with  side  length  equal  to  L,  then,  according  to  Eq.  (4.57), 
the  components  of  momentum  p  =  fik  will  be  related  to  the  integers 
HI,  t\i  and  rtj  characterizing  the  energy  level  by  the  expressions 


*"/."'    y     —/T>     *         L 
We  note  that  a  unit  interval  of  quantum  numbers 


toii\nAnA  =     -h>cl*p.  (6.30a) 

is  associated  with  only  one  level,  occupied  by  two  electrons.  There- 
fore, if  there  are  p0  electrons  per  unit  volume,  the  maximum 
momentum  of  an  electron  at  absolute  zero  (T  —  0)  can  be  determined 
from  the  relation 

pmax 


or 

Pmax=ft(3^p0)f/i.  (6.32) 

The  corresponding  maximum  kinetic  energy  is 


i(3«*P.)V..  (6.33) 

This  energy  is  called  the  Fermi  energy. 
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As    an   example,    let  us  compute  the  value  of  this  energy  for 
silver.    The  density  of  silver  is  10.5,  and  its  atomis  weight  is  107.9. 

Assuming  that  the  number  of  free 
electrons  is  equal  to  the  number  of 
atoms  per  unit  volume,  we  have 


Vacuum 
^max 

ll!IINillill:ll!!lllllllllll|llll| 

Vac 

^ 

uum 

-*» 

Metal 

£ 

A 

0 

£ 

Po  = 


-6.02-  10»  =  5.8-  1022 


where  we  have  used  the  Avogadro 
number  (the  number  of  atoms  in  one 
gram-atom,  equal  to  6.02  •  1021)* 
Hence,  Eq.  (6.33)  gives 

Fig.  6  4.  Model  of  a  potential  well  for  ro  o  e      IA   i>  c   o 

.  ,  £iTnaY  =  o,D  •  iu    "  QTQ==O,O  ev. 

a  metal.  max  *•"*& 

Kmax   is   the  upper  limit  of  the  filled 

levels  at  T  -  o  (the  Fermi  energy)  Since  the  work  function  for  silver 

is     W  =  3J  ev,    the   depth   of  the 

potential  well  in  silver  is  found  to  be  equal  to  9  ev.  A  schematic 
diagram  of  the  filling  of  energy  levels  in  a  metal  is  given  in  Fig.  6.4. 
The    average  energy  of  an  electron  in  a  metal  is  given  by  the 
equation 


770    _  2      f      p'_   d*p_  __  3  pn 
^av  —  "p"o     .1      2/Wo  8*  W  —  5  ^  max ' 


(6.33a) 


In  agreement  with  experiment,  it  follows  that  at  relatively  low 
temperatures  the  electron  gas  makes  no  contribution  to  the  specific 
heat,  since 


If  the  temperature  differs  from  zero,  some  of  the  electrons  will 
jump  into  higher  energy  levels.  The  distribution  of  electrons  in 
the  higher  levels  will  not  be  characterized  by  a  Maxwellian  dis- 
tribution, but  by  the  Fermi  distribution  function 


fE/kT 


+  1 


(6.34) 


At  T  =  0  (the  case  considered  above),  this  function  equals  unity  if 
£'<£roax,  and  zero  if  £>£max»  For  TV  0  the  average  energy  can 
be  obtained  from  the  expression 
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Hence,  the  contribution  of  each  electron  to  the  specific  heat  is 

'T  '  3 

*•  (6'35) 


which  vanishes  as  T->0.    At  high  temperatures,  when  the  quantity 
eE/kT  becomes  much  greater  than  unity,  the  Fermi-Dirac  distri- 
bution function  (6.34)  approaches  the  classical  Maxwell-Boltzmann 
distribution 

f  =  Ae-E/kT,  (6.36) 

which,  as  we  know,  implies  the  following  expression  for  the  average 
energy  of  a  free  electron: 

J7el  *'    AT1  /C    Q7\ 

/-»  fl  v ~?c  K 1  -  ^  D .  O  I  J 

By   comparing  this  expression  with  the  average  energy  at  low 
temperatures  [see  Eq.  (6.33a)]  and  using  the  condition 

£^>E°V  (6.38) 

it  is  possible  to  define  the  degeneracy  temperature 

rdeg~^(3~2po)v'.  (6.39) 

At  temperatures  higher  than  TdeK  (T>  Tdeg),  we  can  use  classical 
statistics  to  describe  the  behavior  of  electrons  in  the  metal.  If 
we  substitute  the  value  of  p0f  say,  for  silver,  we  obtain  rdeg  ^-  10—20 
thousand  degrees.  Thus,  at  all  temperatures  at  which  a  metal 
exists  in  the  solid  state,  the  electron  gas  has  a  certain  degree  of 
degeneracy.  This  means  that  in  discussing  the  properties  of  elec- 
trons, we  must  use  only  Fermi-Dirac  statistics,  and,  moreover, 
the  principal  term  in  the  expression  for  the  kinetic  energy  of  free 
electrons  is  independent  of  temperature. 

This  large  value  of  the  degeneracy  temperature  is  associated 
with  the  small  mass  of  the  electron  m0.  Ions  and  molecules  have 
a  mass  thousands  of  times  greater  than  the  mass  of  an  electron, 
and  therefore  classical  statistics  applies  to  them  at  ordinary 
temperatures. 


D.    REMOVAL  OF  ELECTRONS  FROM  A  METAL. 
COLD  EMISSION 

From   the   potential-well   model   of  electrons  in  a  metal  (see 
Fig.    6.4)    we   can  see  that  to  remove    an  electron  from  a  metal 
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it  is  necessary  to  impart  to  it  an  amount  of  energy  no  smaller  than 
the  work  function 

\\7 17    V  I CL    4Q\ 

w  —  Ko       ^max'  vo.tuj 

As  we  know,  in  the  external  photoelectric  effect  an  electron 
receives  an  energy  fan  from  the  absorbed  photon.  Thus,  an  electron 
can  leave  the  metal  with  a  kinetic  energy 


*  —  ft(0  — -  W 


(6.41) 


(Einstein's  equation).  It  follows  that  the  work  function  represents 
the  minimum  amount  of  energy  that  must  be  added  to  the  electron 
in  order  to  make  its  energy  greater  than  the  height  of  the  potential 
barrier. 


Metal 


Potential  energy  in  the 

absence  of  a  field 
Vacuum         / 


Potential  energy  in  the 
presence  of  a  field 


Fig     6.5.    Potential   energy   of   an   electron   in    a   metal 
in    the    absence    and    in    the   presence    of    an    external 

electric  field. 

The   dashed  line   shows   the  behavior  of  the  potential 

energy     curve    when     the     image    force    is     taken    into 

account 

In  a  metal  at  T  =  0  some  of  the  electrons  occupy  energy 
levels  lying  above  the  Fermi  level.  If  we  increase  the  kinetic 
energy  of  the  electron  gas  by  heating  the  metal,  a  certain  fraction 
of  electrons  may  acquire  an  energy  exceeding  the  height  of  the 
potential  barrier,  and  thus  a  current  will  flow  from  the  metal. 
This  phenomenon,  which  is  known  as  thermionic  emission,  is  used 
to  obtain  an  electron  beam  in  electron  tubes.  Under  the  action  of 
an  external  electrostatic  field,  this  current  may  also  arise  even  at 
lower  temperatures.  Let  us  consider  the  influence  of  an  external 
electric  field  g  applied  to  the  surface  of  a  conductor  in  the  negative 
x  direction.  The  potential  energy  for  this  case  is 

V  (x)  =  V0  —  ej&x,  (6.42) 

where  e  =  ~e(}    is    the  electron  charge  and   g  is  the  electric  field 
intensity  (see  Fig.  6.5). 
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In  addition  to  the  external  electric  field,  the  electron  experiences 
an  electric  force  called  the  image  force.  This  force  arises  because 
an  electron  with  charge  —  e«  induces  an  "  image"  charge  eQ  at  the 
surface  of  the  metal  (see  Fig.  6.6). 
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Fig.  6  6.  Image  forces:  an  electron  outside 

the  metal  experiences  the  attractive  force 

of  the  induced  charge. 

Thus  the  total  force  acting  on  the  electron  is 

/?  =  <?„§  — ^g.  (6.43) 

The  effective  potential  energy,  taking  into  account  the  image  force, 
is  of  the  form 


l/eff  =Vt-e&x-.  (6.44) 

The  quantity  V  eff  has  a  maximum  at  the  point  A:O  : 

<6-45) 


The  maximum  value  of  Veff  is  less  than  1/0  because 

VQ-V3&.  (6.46) 


Thus,  taking  into  account  the  electric  image  force  shows  that  when 
an  external  field  is  applied,  the  work  function  decreases  and  be- 
comes equal  to 

(6.47) 


The  electric  image  forces,  however,  do  not  explain  cold  emission. 
In  fact,  an  estimate  of  the  maximum  current  (with  W"  =  0)  for 
tungsten,  for  instance,  gives  the  value 

g  =  ^2^2.108v/cm,  (6.48) 

eo 

whereas  experimentally  rather  strong  current  is  obtained  with  a 
field  as  low  as  g^4  •  106  v/cm  (Millikan). 
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Thus,  within  the  framework  of  the  classical  theory  it  is 
impossible  to  explain  quantitatively  the  cold  emission  of  elec- 
trons. 

In  the  quantum  theory  of  this  phenomenon  (essentially  the  trans- 
mission of  electrons  through  a  potential  barrier),  we  limit  ourselves 
to  Eq.  (6.42)  for  the  potential  energy  and  neglect  the  electric  image 
force  since  it  does  not  significantly  affect  the  final  result.  It  can  be 
seen  from  the  graph  of  potential  energy  (see  Fig.  6.5)  that  the 
external  electric  field  produces  a  potential  barrier  of  finite  width. 
Because  of  the  tunnel  effect  an  electron  can  penetrate  this  barrier, 
the  transmission  coefficient  being  given  by 


-  •-  V2mQ       V~V  (x)  -  E  dx 

D=e  «  (6.49) 

The  integral  in  the  exponent  must  be  taken  over  the  entire  width  of 
the  barrier  from  x  =  0  to  the  point  x  =  x{  given  by  the  condition 


—  *0&*i  =  £,  that  is,    *!=  V°~E  .  (6.50) 


Then 


•*"!  xl 

J  W(d=Edx=\ 


=  V~*A  J  V*i-xdx  =  l 


Finally,  we  obtain  the  following  expression  for  the  transmission 
coefficient  D: 


D=e~  -       °      MS     =r»"f  (6.52) 

where  the  quantity  80  depends  on  the  nature  of  the  metal  and  the 
energy  of  the  free  electrons  inside  the  metal.  The  cold  emission 
current  is  proportional  to  the  transmission  coefficient 

y=/0D==yor  s~.  (6.53) 

It  follows  from  the  last  equation  that  cold  emission  should  be 
observed  for  an  electron  field  of  g~108  v/cm.  This  result  is  in 
good  agreement  with  experimental  data. 
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E.    CONTACT  POTENTIALS 

Contact  potentials,  which  were  discovered  by  Volta,  can  also  be 
explained  on  the  basis  of  the  tunnel  effect.  Let  us  consider  two 
different  metals  with  different  work  functions  and  different  Fermi 
energies  (see  Fig.  6.7).  If  these  two  metals  are  brought  into  contact, 
they  will  still  be  separated  by  a  potential  barrier  of  finite  width. 


Vacuum         Metal  I 


Vacuum 


Metal  II        Vacuum 


•a 


Fig.  6.7.  Two  metals  before  they  are  placed  in  contact  with  each 

other. 

Wj    and  W/2   are  work  functions,   EQJ   and  &Q2  are  ^e  uPPer  limits 
of  the  filled  levels  (the  Fermi  energies) 

Since  a  certain  number  of  filled  energy  levels  in  metal  I  lie  above 
the  highest  filled  level  of  metal  II,  electrons  can  move  from  metal  I 
into  the  empty  levels  of  metal  II  by  the  tunnel  effect.  From  Fig.  6.7, 
it  is  seen  that  no  flow  in  the  opposite  direction  is  possible,  since 
electrons  of  metal  II  would  then  have  energies  corresponding  to 
filled  levels  of  metal  I.  It  is  obvious  that  the  electric  current  from 
metal  I  to  II  ceases  only  when  the  uppermost  filled  levels  of  both 
metals  are  of  the  same  energy. 


Fig.    6.8.    Two    metals    after    they    have    been    brought    into 
contact.     Formation  of  the  contact  potential. 

As  a  result  of  the  tunnel  effect,  metal  II  acquires  an  excess  of 
electrons  and  is  charged  negatively,  whereas  metal  I  is  charged 
positively.  Thus,  the  energy  levels  of  metal  II  are  shifted  upwards 
relative  to  those  of  metal  I  (see  Fig.  6.8).  After  the  Fermi  levels 
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in  both  metals  are  equalized,  the  electric  current  ceases,  but  then 
there  arises  a  potential  difference  proportional  to  the  difference 
between  the  work  functions  of  the  rnetals: 

A(D  =  ^&i  —  <&o= — 5-H — l  (6.54) 

J  a  f,  *  *  * 


This  quantity  is  called  the  contact  potential. 


F.    THE  MOTION  OF  ELECTRONS  IN  A 
PERIODIC  ELECTRIC  FIELD  (THE  ONE-DIMENSIONAL 
KRONIG-PENNEY  MODEL) 

As  has  been  previously  mentioned,  the  representation  of  the 
motion  of  an  electron  in  a  metal  in  terms  of  the  potential  well 
model  is  an  approximation  in  which  we  average  out  the  periodic 
potential  of  the  lattice.  A  number  of  characteristic  features  of  the 
motion  of  electrons  in  a  crystal  appear  only  when  the  periodic 
variation  of  the  potential  is  taken  into  account.  In  the  general  case, 
the  solution  of  the  problem  is  very  complicated.  In  order  to  deter- 
mine some  of  the  qualitative  features  of  this  motion,  we  may  con- 
sider, however,  a  simplified  model  of  a  crystal. 

In  the  one-dimensional  Kronig-Penney  model,  the  periodic 
electric  field  produced  by  the  positive  ions  of  the  crystal  is  approx- 
imated by  a  periodic  square-well  potential  of  the  form  shown  in 
Fig.  6.9.  The  width  of  each  well  is  denoted  by  a,  and  the  width  of 
the  barrier  between  two  successive  wells  by  b.  Thus,  the  period  of 
the  potential  (the  equivalent  of  the  lattice  constant)  is  c  =  a-\-b. 
The  barrier  height  is  set  equal  to  K0. 


/?=/ 


n=3 


Fig     6.9.    One-dimensional    Kronig-Penney   model    of   a 
crystal. 

The   solution   of  Schrodinger's  equation  for  the  nth  section  of 
the  periodic  potential  has  the  following  form: 
for  the  potential  well: 


=  An  sin  kxn  +  Bn  cos  kxn, 


(6.55) 


for  the  potential  barrier: 

<[>;  =  An  sinh  x  (xfl  —  c)  +  B'n  cosh  *  (*,t  —  c).  (6.56) 
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Here  k  =  j--  ,  *  =  —-  •  *«  =  *  —  */i  ;  the  coordinate  *n 

is  measured  from  the  origin  of  the  /zth  section  (that  is,  the  nth  well). 
Similarly,  for  the  (rc-f-  l)st  section,  we  can  write 

*n+i  =  »«,isinfaf/1+1  +  fl/1+1cosft^1.    -  (6.57) 

We  first  join  solutions  (6.55)  and  (6.56)  at  the  point  x  =  en  -+-  a   (that 
is,  &txn  =  a),  obtaining 


AnBiuka-\-Bncoaka  =  —  ^sinhxft  +  Bicoshxft,   >•"      (6.57a) 


Next,  joining  the  solutions   (6.56)  and  (6.57)  at  the  point  x  =  c  (n  -f-  1) 
and  noting  that  xn  =  c  and  ^,=0,  we  obtain 


Let  us  substitute  (6.58)  into  (6.57a)  and  simplify  the  problems  by 
considering  the  limiting  Kronig-  Penney  case  in  which  the  width  of 
the  barrier  between  two  wells  tends  to  zero  (b->0),  the  height  Ku 
tends  to  infinity,  and  the  width  of  the  well  remains  constant: 


—  - 
Then,  since  cosh  t.b->  1  and  sinh  xb->x&,  we  have 

An  sin  ka  +  Bn  cos  ka  =  Bn  ,  lf  59 

An  cos  ka  —  Bn  sin  /ca  =  /1,M1  —  -^  5/Ml, 

Equations  (6.59)  are  linear  difference  equations;  their  solutions 
should  be  sought  in  the  form 

Aa  =  dr*9     Bn  =  C/\  (6.60) 

Substituting  (6.60)  into  (6.59)  and  dividing  both  equations  by  rn,  we 
obtain  an  equation  from  which  we  can  determine  the  quantity  r  and 
the  relationship  between  the  coefficients  C{  and  C>: 

Q  sin  ka  =  C2  (r  —  cos  ka), 
C,  (?    r  —  sin  ka}  =  C,  (r  —  cos  ka).  <6"  61) 
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Multiplying  these  equations  together  and  dividing  both  sides  of  the 
resulting  equation  by  C^C^  we  obtain  an  equation  for  r: 

r*  —  2r  cos  k'a  +  1  =  0,  (6.  62) 

where  cos  k'a  is  given  by  the  equation 

cos  k'a  =  ~  sin  ka  +  cos  ka.  (6.  63) 

As  we  shall  see  below,  (6.63)  is  the  fundamental  equation  for  the 
energy  levels  in  the  periodic  field  of  a  crystal. 
The  solution  of  (6.62)  has  the  form 

r  =  cos  k'a  ±  i  sin  k'a  =  e±  ik'a  .  (6.  64) 

We  note  that  if  the  right-hand  side  of  (6.63)  is  greater  than  unity, 
//will  be  imaginary,  and  in  this  case  we  get  an  exponential  solution 

r  =  e±lk>\a==e±*'a  (6.65) 

Let  us  examine  in  greater  detail  the  solution  for  the  case  of  real 
values  of  k' 

r  —  eik'a. 
From  Eqs.  (6.61)  and  (6.60),  we  have 

C.—  c/-008** 

01       °°     smka      • 


Substituting  the  values  of  ,4,,  and  £Ainto  (6.65)  and  using  the  fact  that 
xn~x  —  an  when  b->Q,  we  find  an  expression  for  the  wave  function 
in  the  crystal: 

y^C^'Uv  (6.66) 

where  Un  is  a  function  with  the  same  periodicity  as  the  crystal 
lattice 

~  *'**'*  sin  ^  -  e~  w**  sin  *  <*«  ""  a)]-       <6-  67) 

In  particular,  if  P  =  0  (that  is,  there  is  no  barrier),  we  find  from 
(6.63)  that£'  =  /e;  in  this  case  the  function  £/n  becomes  unity. 

It  follows  from  (6.66)  that  an  electron  can  move  freely  in  the 
crystal  if  K  is  a  real  quantity,  that  is,  if  the  right-hand  side  of 
(6.63)  is  less  than  unity. 
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G.    BASIC  PRINCIPLES  OF  THE  ELECTRON  THEORY 
OF  CONDUCTIVITY  OF  CRYSTALS 

The  quantum  theory  of  electron  motion  in  a  crystal  lattice  provides  a  key  to  distin- 
guishing between  conductors,  dielectrics  or  insulators,  and  semiconductors  (which  in  a 
sense  form  an  intermediate  class  of  solids).  We  do  not  intend  to  treat  this  subject 
quantitatively  and  shall  confine  ourselves  to  a  few  qualitative  remarks  based  on  the  one- 
dimensional  Kronig- Penney  model. 

As  the  starting  point  of  our  analysis,  we  shall  use  Eq.  (6.63)  to  determine  the  possible 
values  of  the  electron  energy  in  a  crystal  lattice.  This  equation  is 

p 

cos  k'a  =  -r—  sin  ka  +  co«?  ka,  (6.63) 

where  a  is  the  lattice  constant,  and  the  quantity  k  =  —  -~ —  gives  the  electron  energy. 

The  eigenvalues  of  the  electron  energy  can  be  found  from  the  condition  that  #  must  be 
real.  This  condition  means  that  the  right-hand  side  of  Eq.  (6.63)  must  be  less  than  unity. 
Setting  P  =  oo,  we  obtain  the  energy  spectrum  of  isolated  atoms  (in  this  case  the  atoms 
are  separated  from  one  another  by  an  impenetrable  barrier).  The  energy  spectrum  will 
then  consist  of  a  separate  set  of  levels  for  each  well: 

_  n*W  (6.68) 

L>n 


where  n  =  1 ,  2,  3  ....  We  shall  not  consider  the  negative  values  of  //,  since  they  give 
exactly  the  same  values  of  energy,  but  correspond  to  the  motion  of  electrons  in  the 
negative  x  direction.  The  first  two  levels  (n  =  1,  2) 

£'=£?>  E*=*E*  <6-69> 

of  isolated  atoms  are  shown  in  Pig.  6.10. 


£ 

Energy  levels  of 


isolated  atoms 


Energy  levels  of  the 
crystal  lattice 


Fig.  6.  10.  The  formation  of  energy  bands  in  a  crystal  lattice. 

If  P  is  finite  [see  Eq.  (6.63)],  it  is  easiest  to  determine  the  energy  levels  graphically 
(see  Fig.  6.1  IX  The  allowed  values  of  k  (and  therefore  of  the  energy)  correspond  to  the 
values  of  the  right-hand  side  of  Eq.  (6.63)  lying  between  —  land  -\-  1.  In  Fig.  6.11,  these 
allowed  values  are  denoted  by  a  heavy  line.  Thus,  in  a  crystal  lattice  containing  N  atoms 
each  energy  level  of  an  isolated  atom  is  split  into  N  levels.  Each  of  these  groups  of 
levels  is  called  a  band  (see  Fig.  6.  10). 
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cos  ka 


The  electrons  in  the  crystal  tend  to  occupy  the  lowest  energy  levels  and,  according  to 
the  Pauli  exclusion  principle,  each  level  is  occupied  by  at  most  two  electrons  with 
opposite  spins. 

For  example,  the  crystal  lattice  of  an  alkali  metal  contains  N  valence  electrons  if 
there  are  N  atoms.  The  electrons  occupy  only  half  of  the  lower  band  (since  there  can  be  at 

most  two  electrons  in  each  level).  In 
the  ground  state,  half  of  the  electrons 
move  in  one  direction,  and  the  other 
half  in  the  opposite  direction;  con- 
sequently, the  average  current  is  equal 
to  zero.  When  an  electric  field  is 
applied,  more  than  half  of  the  electrons 
move  in  some  preferred  direction, 
thus  producing  an  electric  current  and 
moving  up  into  higher  energy  levels. 
Therefore,  in  a  metal,  either  the  first 
band  must  contain  a  sufficient  number 
of  unfilled  levels,  or  it  must  come  into 
contact  with  a  second  empty  band,  called 
the  conduction  blind* 

On  the  contrary,  in  a  perfect  insula- 
tor all  levels  in  the  first  or  valence 


Fig.  6.11.   Determination  of  possible  energy 

levels  in  a  crystal  lattice. 
The  heavy  lines  on  the  abscissa  axis  indi- 
cate    the     allowed     values     of    ka    (allowed 
bands). 


band  are  occupied  and  the  levels  of  the  second  (conduction)  band  are  all  empty.  The  energy 
spacing  between  these  bands  is  usually  several  electron  volts.  It  is  easy  to  show  that, 
when  a  field  is  applied,  the  electrons  cannot  acquire  a  preferred  direction  of  motion.  The 
direction  of  motion  of  an  electron  can  be  reversed  only  if  the  electron  goes  into  a  different 
energy  state.  But  since  all  the  energy  levels  are  occupied,  this  can  only  happen  if  the 
electron  occupying  that  other  energy  state  makes  the  opposite  transition  to  the  state 
originally  occupied  by  the  first  electron.  Therefore,  on  the  average,  there  can  be  no 
preferred  direction  of  motion  of  the  electrons  even  when  a  field  is  applied.  A  diagram 
of  the  energy  levels  of  a  conductor  and  a  dielectric  is  given  in  Fig.  6. 12. 


Metal 


Perfect  dielectric 


1 


Fig.  6.12.  Energy  level  diagram  of  a  metal  and  a  dielectric 


Nevertheless,  every  dielectric  possesses  some  (very  small)  conductivity.  The  con- 
ductivity is  much  greater  in  semiconductors,  where  the  forbidden  band  is  considerably 
smaller  than  in  dielectrics;  it  is  of  the  order  of  1  ev,  and  sometimes  even  less  (for 
example,  in  germanium,  the  width  of  the  allowed  band  is  0.66  ev  at  7'  =  300'KX  At  absolute 
zero,  semiconductors  behave  like  dielectrics;  however,  their  conductivity  increases  with 
rising  temperature,  particularly  in  the  presence  of  impurities.  At  room  temperature, 
the  resistivity  of  semiconductors  is  found  to  be  of  the  order  of  10~2 — 102  ohn>cm.  At  the 
same  time,  the  resistivity  of  dielectrics  lies  within  the  range  of  103— 1018ohnrcm,  and 
that  of  metals  in  the  range  of  10~B— 10~fl  ohnrcm. 

The  conductivity  of  dielectrics  and  semiconductor  scan  be  of  two  types.  Let  us  assume 
that  under  the  influence  of  thermal  excitation  or  the  internal  photoelectric  effect  (absorb- 
tion  of  light  by  electrons),  some  of  the  electrons  jump  from  the  valence  into  the  conduction 
band,  leaving  vacant  states  ("holes")  in  the  valence  band.  The  electrons  that  have  jumped 
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into  the  conduction  band  become  carriers  of  electric  charge,  thus  producing  a  current. 
On  the  other  hand,  as  soon  as  several  states  become  empty  in  the  valence  band,  an  elec- 
tric current  can  also  be  produced  due  to  the  motion  of  electrons  in  the  valence  band 
itself.  It  can  be  shown  that  the  motion  of  a  system  of  electrons  in  an  almost  completely 
filled  valence  band  can  be  treated  in  terms  of  the  motion  of  a  set  of  vacant  states  or 
"holes."  Obviously,  the  holes  move  in  a  direction  opposite  to  the  direction  of  electron 
motion,  so  that  they  behave  like  positively  charged  particles.  Thus,  there  are  two  ways 
in  which  charge  can  be  transported  and, therefore, two  types  of  electric  current  in  solids: 
n  type  (  due  to  the  motion  of  electrons)  and  p  type  (due  to  the  motion  of  holes). 

With  the  help  of  the  band  theory,  we  can  easily  give  a  qualitative  explanation  of  a 
number  of  interesting  phenomena. 

For  example,  the  conductivity  of  a  metal  increases  with  decreasing  temperature 
because  its  resistance  is  due  to  the  interaction  between  free  electrons  and  the  vibra- 
tions of  the  lattice.  Since  the  vibrations  of  the  lattice  diminish  as  the  temperature 
decreases,  the  resistance  will  also  decrease.  On  the  contrary,  the  conductivity  of  semi- 
conductors decreases  as  the  temperature  is  lowered,  because  the  number  of  electrons 
in  the  conduction  band  becomes  increasingly  smaller.  The  existence  of  P  type  conductivity 
in  semiconductors  has  been  demonstrated  in  investigations  of  the  Hall  effect  and  thermo- 
electric emf.  The  sign  of  the  potentials  appearing  in  these  phenomena  is  determined  by 
the  sign  of  the  current  carriers.  Experiment  shows  that  in  some  semiconductors  the 
sign  of  the  potentials  corresponds  to  electron  carriers,  whereas  in  other  semiconductors 
the  sign  of  the  potentials  is  reversed;  that  is,  the  current  carriers  act  as  if  they  were 
positively  charged  particles.  A  natural  explanation  for  this  is  provided  by  the  concept 
of  "holes." 

The  above  conclusions  in  regard  to  the  conductivity  due  to  transitions  of  electrons 
from  the  valence  band  into  the  conduction  band  (the  long  arrow  on  the  left-hand  side  of 
Fig.  6.13)  and  due  to  the  motion  of  holes  in  the  valence  band  are  based  on  the  assumption 
of  a  perfect  crystal.  This  type  of  conductivity  is  known  as  intrinsic  conductivity. 

Another  type  of  conductivity,  known  as  impurity  conductivity,  also  plays  a  significant 
role  in  semiconductors.  It  is  caused  by  the  presence  of  foreign  impurities  or  other 
structural  defects  in  the  lattice.  Such  disruptions  of  the  perfect  periodicity  of  the  lattice 
lead  to  local  deviations  of  the  field  from  a  perfectly 
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periodic  one.  As  a  result,  discrete  levels  may 
appear  in  the  forbidden  band  of  the  energy  spec- 
trum of  the  electrons  (see  Fig.  6.13).  In  practice, 
the  wave  functions  associated  with  these  levels 
differ  from  zero  only  in  a  certain  region  near  the 
given  defect.  Thus,  discrete  levels  are  sometimes 
called  impurity  levels.  They  do  not  themselves 
contribute  to  the  current  (the  electrons  occupying 
them  are  not  free).  However,  they  may  affect  the 
number  of  electrons  contained  in  both  the  con- 
duction and  the  valence  bands. 

Impurity  levels  (denoted  by  short  arrows  on  the 
right-hand  side  of  Fig.  6. 13)  can  be  divided  into  two 
types:  donor  levels  and  acceptor  levels.  A  donor 
is  capable  of  supplying  electrons  to  the  conduction         Kig-  6.13.  Diagram  of  energy  bands 
band,  so  that  free  electrons  will  appear  in  the  m  a  semiconductor, 

conduction    band.     In    contrast  to  a  donor,  an 

acceptor   absorbs   electrons;   as  a  result  a  "hole"  is  formed  in  the  filled  band  so  that 
electrons  in  this  band  are  able  to  Jump  into  higher  energy  states  (ptype  conductivity X 

If  the  impurity  levels  are  located  sufficiently  close  to  the  edge  of  the  corresponding 
bands,  lonization  (or  other  effects)  can  arise  fairly  easily  due  to  the  energy  of  thermal 
motion  of  the  lattice.  Therefore,  an  appreciable  number  of  free  electrons  (or  holes)  can 
exist  in  bands  even  under  conditions  when  direct  excitation  of  electrons  from  the  valence 
band  into  the  conduction  band  is  highly  improbable. 

In  the  case  of  impurity  conduction,  the  magnitude  and  type  of  the  conductivity  are 
determined  mainly  by  the  nature  and  concentration  of  the  impurities.  By  varying  the 
impurities,  it  is  possible  to  control  within  a  wide  range  the  magnitude  of  the  conductivity 
as  well  as  its  type  ( n  type  or  p  type).5  This  fact  is  widely  utilized  in  semiconductor 


Pure  germanium  has  a  high  resistance  but  the  addition  of  a  small  amount  of  impurities 
may  give  it  either  n  or  p  type  conductivity 
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electronics.  It  constitutes  the  basis  of  operation  of  modern  high-quality  crystal  diodes 
and  transistor  devices  which  are  capable  of  rectifying  AC  currents,  as  well  as  of  ampli- 
fying and  generating  electrical  oscillations. 

In  conclusion,  we  would  like  to  draw  attention  to  two  important  approximations  which 
we  have  implicitly  assumed  in  the  electron  theory  of  solids.  First,  in  developing  the 
theory  of  metals  and  semiconductors,  we  were  actually  concerned  only  with  the  problem 
of  single  electrons.  Accordingly,  this  theory  is  called  the  single-electron  theory.  The 
collective  properties  of  electrons  in  solids  were  taken  into  account  only  in  connection  with 
the  filling  of  energy  levels  by  free  electrons,  when  we  calculated  certain  statistical 
quantities  (the  Fermi  energy,  specific  heat,  etc.).  Even  in  those  cases  we  completely 
neglected  interactions  between  electrons,  although  they  do  in  fact  experience  mutual 
Coulomb  repulsions.  It  is  easy  to  show  that,  for  the  electron  densities  which  we  observed 
in  metals,  the  average  energy  of  this  repulsion  is  not  necessarily  small  in  comparison 
with  the  Fermi  energy.  There  naturally  arises  the  question  of  the  extent  to  which  this 
crude  single-electron  model  represents  the  properties  of  a  real  metal.  The  answer  to 
this  question  is  given  in  the  many-electron  theory  of  solids,  which  we  cannot  consider 
here  since  this  recently  developed  theory  is  a  highly  specialized  subject.  We  may  note 
that  the  fundamental  notion  of  the  electron  gas  as  a  free  carrier  obeying  Fermi-Dirac 
statistics  is  retained  in  the  multielectron  theory.  Accordingly,  the  laws  of  purely 
statistical  character  obtained  in  the  single-electron  theory  still  hold  true,  such  as,  for 
example,  the  linear  dependence  of  the  electronic  specific  heat  on  temperature.  At  most, 
certain  numerical  coefficients  will  have  to  be  slightly  changed  because  of  the  specific 
form  of  the  wave  functions  for  a  system  of  electrons.  These  refinements,  however, 
involve  exceptionally  large  computational  difficulties. 

On  the  other  hand,  in  dynamical  problems,  where  it  is  essential  to  take  into  account 
the  interactions  between  electrons,  the  simplied  approach  used  above  (the  single-electron 
theory)  is  inadequate.  One  case  of  this  type  is  the  problem  of  the  strength  of  a  metal. 

The  second  approximation  which  is  implicit  in  the  single-electron  theory  concerns  the 
crystal  lattice,  which  is  regarded  simply  as  the  source  of  a  certain  statistical  field.  The 
situation  is  actually  more  complicated  because  the  lattice  ions  execute  a  vibrational 
motion,  which,  as  we  know,  persists  even  at  absolute  zero  (zero-point  vibrations).  In 
order  to  take  this  fact  into  account,  it  is  necessary  to  consider  the  crystal  lattice  as  a 
quantum-mechanical  system  rather  than  simply  as  the  source  of  a  field  (see  Chapter  12,  D). 
The  essential  point  in  this  connection  is  that  the  lattice  is  not  isolated  from  the  system 
of  electrons,  but  coupled  to  it  since  the  electrons  interact  with  the  lattice  ions.  The  energy 
of  this  interaction  can  be  uniquely  represented  by  the  sum  of  two  terms.  One  of  them 
represents  the  potential  energy  of  interaction  between  the  electrons  and  the  stationary 
lattice.  The  motion  of  electrons  in  a  field  of  this  type  was  considered  above  in  the  simple 
case  of  the  Kronig- Penney  model.  The  second  term  is  connected  with  the  deviations  of 
the  ions  from  their  equilibrium  positions  and  represents  the  interaction  energy  between 
the  electrons  and  the  lattice  vibrations.  The  average  value  of  this  interaction  energy  is 
extremely  small  in  comparison  with  the  energy  of  the  electrons  at  the  Fermi  level, 
because  the  amplitudes  of  the  vibrations  of  the  lattice  ions  are  small  (except  at  tempera- 
tures near  the  melting  point). 

The  vibrations  of  the  lattice,  however,  may  play  an  important  role  at  low  temperatures, 
when  the  interaction  of  each  electron  with  the  vibrations  of  the  lattice  gives  rise  to  an 
additional  interaction  in  a  pair  of  electrons.  If  these  two  electrons  have  opposite  spins, 
the  interaction  results  in  an  attraction  (in  contrast  to  electrostatic  repulsion).  As  a 
result,  the  pair  of  electrons  begins  to  move  with  a  certain  degree  of  correlation.  Such  a 
system  is  found  to  have  the  property  of  superconductivity.6  The  smallness  of  the  inter- 
action which  causes  the  correlation  explains  why  superconductivity  is  observed  only  at 
extremely  low  temperatures.7 


A  theory  of  superconductivity  based  on  these  concepts  was  developed  independently 
by  the  American  physicists  Bardeen,  Cooper,  and  Schrieffer,  and  by  the  Russian  physicist 
Bogoiyubov  [See  N  Bogolyubov,  V  Tolmachev,  D.  Shirkov,  A  New  Method  in  the  Theory 
of  Superconductivity  (trans  ),  New  York:  Consultants  Bureau,  1959  1 

7See  R    Peierls,  Quantum  Theory  of  Solids,  Oxford:  Clarendon  Press,   1955;  W    Shock- 
ley,  Electrons  and  Holes  in  Semiconductors,  New  York:  Van  Nostrand,  1950 


Chapt 


er 


Statistical  Interpretation  of  Quantum 
Mechanics 

s~ 

A/ELEMENTS  OF  THE  THEORY  OF  LINEAR 
OPERATORS 


In  our  general  investigation  of  the  Schrodinger  wave  equation 
(Chapter  5),  we  saw  that  in  quantum  mechanics  the  momentum 
operator  p  =  —  i#y  is  associated  with  the  classical  momentum  p  of 

the  particle,  the  energy  operator  E  =  ih-^-  with  the  energy  E,  the 

Hamiltonian  operator  or  simply  Hamiltonian  H  with  Hamilton's 
function  H,  etc.  Before  analyzing  the  physical  meaning  of  the 
quantities  represented  by  operators  in  quantum  theory,  let  us 
consider  certain  general  aspects  of  the  theory  of  operators. 

In  the  same  way  as  a  function  relates  a  number  x  with  another 
number  y  =  f(x)9  an  operator  M  associates  one  function/^)  with 
another  function 

*(*)  =  M/M,  (7-1) 

according  to  some  given  rule. 

In  order  to  satisfy  the  principle  of  superposition,  only  linear 
operators  with  the  two  following  fundamental  properties  are  used  in 
quantum  mechanics: 

M  =  A 

(      ' 


where  C  is  an  arbitrary  constant. 

The  linear  operators  most  commonly  encountered  are  the  differ- 

entiation sign  (for  example,  the  momentum  operator  M  =  y  -5-  and 

the  LaplacianM=v*)  and  the  integration  sign. 
In  Poissonfs  equation 

V7  (r)  =  p(r)  (7.3) 
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the  operator  is  the  Laplacian  which  converts  the  function  /  (r)  into 
another  function  p(/-).  Conversely,  we  can  solve  Poisson's  equation 
and  find 


)  =  -  J 


(7.3a) 


Here  the  operator  v~*nas  the  form  of  a  definite  integral,  the  kernel 
of  the  operator  being 


>  r  )  ~4*  |  r  —  rf  f  4*  /(*_*')•  +  (y-yj  +  («-*')•  ' 


The  operators  y*  anc*  V~*»  which  have  the  same  effect  as  multiplica- 
tion by  unity  when  they  are  applied  in  succession,  are  called  inverse 
operators. 

The  Hamiltonian  operator 


consists    of  the  sum  of  the  kinetic  energy  operator  T==  —  -~  —  v% 

which  is  directly  porportional  to  the  Laplacian,  and  the  potential 
energy  V9  which  is  simply  a  function  of  the  coordinates. 

We   can   regard   the   action  of  the  potential  energyj/  (x^  on  the 

^^Uonji^he  action  ^  of'AJJJ!^ 

are  satisfieHT  THerSelore,  besides  differential  and  integral 
operators,  the  linear  operators  that  may  be  used  in  quantum  me- 
chanics include  any  function  of  coordinates  whose  action  on  the 
wave  function  is  simply  to  multiply  it.  For  example,  the  coordinate 
r  is  just  as  entitled  to  be  considered  an  operator  (the  position 
operator)1  as  the  momentum  operator  p  =  —  ih\^  which  is  a  differ- 
ential operator.  It  is  worth  noting  in  this  connection  that  in  quantum 
mechanics  r  does  not  represent  the  position  of  a  particle,  but  is  an 
argument  of  the  wave  function  and  determines  its  value  in  the 
coordinate  space.  The  quantity  which  is  equivalent  to  the  position 
can  be  found  from  the  operator  r  and  the  function  ^(r)  in  the  same 
way  as  the  momentum  of  a  particle  is  found  from  the  momentum 
operator,  that  is,  by  averaging  (see  the  following). 


This  will  become  particularly  clear  when  we  write  the  Schrodinger  equation  in  momen- 
tum space  (see  below),  where  the  wave  function  depends  on  p  In  this  case,  the  momentum 
operator  p  will  correspond  to  multiplication  by  an  ordinary  function,  and  the  position  oper- 
ator x  to  differentiation  with  respect  to  p. 
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B.    ELEMENTS  OF  REPRESENTATION  THEORY 

The  position  operators  x,  y  and  z  obviously  commute  with  each 
other,  since  operating  with  them  is  equivalent  to  multiplying  by 
ordinary  numbers.  Accordingly,  xy  =  yx,  xz  =  zx  9  and  soon.  The 

operators    p  v  =  —  ih  ()  v  ,  pv  =  —  -  ih  ^-  and  p^,  =  —  ih  ^-    also  commute 

with  each  other,  since  the  result  of  differentiation  is  independent 
of  the  order  in  which  it  is  performed 

d*  d2  d*  d*  •, 

-=—  ,    =  ~-       —-  =r=      -  ,  and  so  on. 
oxdy        dydx  '       dydz        0:dy 

Similarly,  it  can  readily  be  seen  that  the  pairs  of  operators  pv  and 
//,  Py  and  x9  and  so  on,  also  commute  with  each  other: 


that  is, 

P.V!/=</PJC- 

An  example  of  a  pair  of  noncommuting  operators  is  provided 
by  x  and  p^.  Indeed, 

*p^  =  -/torjj.-,  (7.4) 

whereas 

P^t  =  -  in  ^-  =-in(\  +  x^  $.  (7.5) 

Consequently, 

(Px^-^pJ  *  =  -«**,  (7-6) 

that  is, 

pxx  —  xpx  =  —  ifi.  (7.7) 

In  a  similar  fashion,  it  is  readily  shown  that 


The  noncommutativity  of  these  operators,  as  expressed  by  Eqs. 
(7.7)  and  (7.8),  is  a  significant  feature  of  quantum  mechanics. 

The    specific    choice   of  the   form   of  position  and  momentum 

*— -— — """TV 

operators  x  and(p^  =  —  ih\ ~\  satisfying  Eqs.  (7.7)  and  (7.8)  corre- 
sponds to  the  so-UkllUd  Coordinate  representation,  in  which  the 
wave  function  deoends  onIVTSffiH??3T59ri9rflTO 
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The  Schrodinger  equation  can  also  be  written  in  the  momentum 
representation.  For  the  sake  of  simplicity,  let  us  take  the  one- 
dimensional  case.  Using  the  8  functions  [see  (4.70)],  we  expand 
<[>  (A-)  into  a  Fourier  integral 


<|*  (x)  =  f 


5  (x  —  x')  dx'  =  ~ 


Since  k  =  ~9  wherep  =  /?A,  we  can  rewrite  this  equation  in  the  form 

T  V  (7.9) 


~9 


=^  J  *t(xyrl$*'d*. 


The  Fourier  transform  of  the  function  ty  (x)  ,  namely,  the  function 
<p(/>)  which  depends  on  momentum,  is  called  the  wave  function  in  the 
momentum  representation.  Equations  (7.9)  and  (7.10)  relate  the 
wave  functions  in  the  coordinate  and  the  momentum  representa- 
tions. 

Let  us  find  what  operator  in  the  momentum  representation 
corresponds  to  the  position  coordinate  x.  We  note  that  in  this 
representation  we  need  not  write  p  as  an  operator  (that  is,  in 
roman  type).  If  we  substitute  fy(x)  for  xfy(x)  in  Eq.  (7.10),  then 


XT  (p)  =  A_  J  x<!f  (x)  e  -'    *  dx  =  t*&$-.  (7.11) 

and  thus  in  the  momentum  representation  the  operator  x  has  the 
form  ~    -    -       „__-»<*• 


It    is    easily    verified   that   Eq.    (7.7)  continues  to  hold  true  in  the 
momentum  representation,  since 


In  exactly  the  same  way,  the  fundamental  relation  (7.7)  will  be 
satisfied  if  x  and  p  are  replaced  by  certain  appropriate  matrices. 
This  form  of  representation  is  called  matrix  representation;  it 
was  introduced  by  Heisenberg  somewhat  before  the  discovery  of 
Schrb'dinger's  equation. 

The  remainder  of  our  discussion  of  operators  will  be  conducted 
in  the  coordinate  representation. 
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C.  AVERAGE  VALUES  OF  OPERATORS 

In  classical  mechanics  the  motion  of  an  individual  point  is 
exactly  specified  by  a  function  relating  its  position  to  time.  This 
dependence  can  be  uniquely  determined  from  the  fundamental 
differential  equation  of  motion 

mjr=—  V  V(r). 

Once  we  have  determined  ras  a  function  of  time,  we  can  also  deter- 
mine the  momentum  and  energy  of  the  particle. 

The  situation  is  somewhat  different  if  there  are  many  particles 
involved,  as,  for  example,  in  the  kinetic  theory  of  gases.  In  this 
case  statistical  laws  characteristic  of  a  large  collection  of  parti- 
cles must  be  used.  It  turns  out  that  the  particles  of  such  a  collection 
obey  certain  distribution  laws,  which,  generally  speaking,  apply  to 
both  coordinate  space  and  momentum  space  (that  is,  a  distribution 
describing  both  the  velocities  and  the  energies).  The  function  /  that 
characterizes  this  distribution  is  called  the  distribution  function. 
Thus,  when  we  deal  with  a  large  collection  of  particles,  we  can 
only  consider  a  probability  that  a  particle  possesses  particular 
coordinate  and  momentum  values.  From  the  distribution  func- 
tion we  can  finA  that  average  values  of  the  position  and  momen- 
tum. 

x  =  J  xffxfp.    px=  J  pjd*xdAp, 
and  the  mean  square  of  these  quantities 

Afj  =  V  x*fd*xd*p ,  and  so  on, 

which  in  accordance  with  the  law  of  large  numbers  should  agree 
with  the  corresponding  experimental  values. 

We  must  mention  one  characteristic  feature  of  these  statistical 
laws.  In  classical  physics  they  are  a  result  of  averaging  over  the 
so-called  "hidden"  parameters,  which  determine  the  motion  of  each 
particle  in  accordance  with  Newton's  equations.  These  hidden 
parameters  do  not  appear  in  the  final  results.  In  principle,  however, 
classical  theory  enables  us  to  explain  why,  at  any  instant  of  time, 
the  coordinates  and  momenta  of  individual  particles  differ  from  the 
average  values,  even  though  this  explanation  may  be  very  com- 
plicated mathematically. 

In  quantum  mechanics  the  behavior  of  particles  is  described  by 
the  wave  function  ty(r,  /),  which  is  a  probability  function  even  when 
the  system  it  describes  consists  only  of  a  single  particle.  Thus, 
quantum  mechanics  allows  us  to  determine  only  the  average  values 
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of  dynamic  variables  regardless  of  whether  there  is  a  large 
number  of  microparticles  or  only  one.  It  must  be  emphasized  that 
in  quantum  theory  it  is  in  principle  impossible  to  explain  the 
deviations  of  observed  variables  from  the  average  values. 2  The 
method  of  calculating  averages  in  quantum  mechanics  is  similar 
to  that  used  in  statistical  mechanics.  The  basic  formula  used  for 
this  purpose  is 

,  (7.12) 

where  M  is  an  arbitrary  operator  (as  a  special  case,  it  may  be  a 
number),  and  the  quantity  ty*  (O'HO  plays  the  role  of  the  distribution 
function  /  provided  that  the  wave  functions  ty(t)  are  normalized: 


The  average  values  of  the  position  and  momentum,  as  has  already 
been  mentioned,  are  in  fact  computed  in  basically  the  same  way: 


.  (7.13) 

Here  Ic  is  the  coordinate  of  the  center  of  mass  of  the  wave  packet 
associated  with  the  function  <|>  (/),  and  pKis  the  momentum  of  this 
center  of  mass. 

Since  the  outcome  of  a  physical  measurement  is  a  real  quantity, 
the  average  values  must  be  represented  by  real  numbers.  There- 
fore, the  following  equation  must  hold: 

(A/)*  =  Al.  (7.14) 

When  this  requirement  is  satisfied,  the  corresponding  operators 
are  said  to  be  self-conjugate  (or  Hermitian). 

In  particular,  we  shall  show  that  the  operator  pA  satisfies  the 
condition  (7.14),  even  though  it  appears  to  be  purely  imaginary.  As 
a  preliminary,  we  must  first  prove  an  important  theorem  for 
"transferring"  a  derivative.  This  theorem  is  as  follows.  Suppose 
we  have  an  integral 

<x» 

"  uv(n'dxt  (7.15) 


Von  Neumann  has  demonstrated  that  hidden  parameters  cannot  be  the  basis  for  the 
statistical  laws  of  quantum  mechanics  Von  Neumann's  proof,  howeVer,  is  valid  only  with- 
in the  limits  of  the  actual  framework  of  quantum  mechanics  itself,  and  if  quantum  me- 
chanics is  not  taken  as  an  ultimate  theory,  Von  Neumann's  theorem  cannot  be  regarded  as 
generally  valid. 
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where  v<n]  =  ~^- .  Then,  if  all  terms  of  the  type 

oo  oo 

'V-2'],,..,       r«<«-"w|  (7.16) 


vanish,  the  result  of  integration  of  G  is  not  altered  if  we  transfer 
the  /ith  derivative  of  the  function  v  to  the  function  u  in  (7.15)  and 
place  the  factor  (—1)"  in  front  of  the  integral: 


ultt]vdx.  (7.17) 


Indeed,  if  we  carry  out  an  /7-fold  integration  by  parts  in  (7.15)  and 
assume  that  all  terms  in  (7.  16),  vanish,  we  obtain  the  relationship 
(7.17).  In  the  case  of  a  discrete  spectrum,  the  conditions  (7.16)  are 
always  satisfied  because  the  wave  function  decreases  exponentially 
at  infinity.  In  the  case  of  free  motion  (that  is,  a  continuous  spec- 
trum), these  expressions  vanish  as  a  consequence  of  the  periodicity 
condition.  Physically,  the  fulfillment  of  condition  (7.16)  means  that 
no  particles  or  currents  exist  at  infinity. 

Returning  to  the  proof  of  the  self-conjugateness  of  the  operator 
pv,  we  substitute 

u  ==  <p  (/),    v  =  —  ihty  (t)  and  n  =  1 
into  Eq.  (7.17).  From  this  it  immediately  follows  that 


and  thus  the  self-conjugateness  condition  (7.14)  is  satisfied  for  p^,. 
We  note  that  unlike  the  operator  pAr  =  —  ih  ^v,the  real  operator-^ 

is  not  self-conjugate  and  its  average  value  has  no  physical  meaning. 
If  an  operator  M  has  only  one  eigenvalue  >.  (and  one  eigenfunc- 
tion  <^),  it  is  readily  seen  that  this  eigenvalue  is  identical  with  the 
average  value  of  the  operator.  Indeed,  using  the  general  rule  (7.12) 
for  determining  the  average  value  of  an  operator  and  substituting 
the  equation 


we  obtain  for  M 

==*  f  <t*(t)tf(t)<Px  =  \.  (7.19) 


126  NONRELATIVISTIC    QUANTUM    MECHANICS 

On  the  other  hand,  suppose  that  the  operator  M  [Eq.  (7.18)]  has 
several  eigenvalues  Xt>  X.2l  .  .  .  ,  Xn,  .  .  .  ,  corresponding  to  the  functions 
<MO.  MO»"-»  M0»---  (*or  example,  this  may  be  the  case  for 
the  energy  operator  E  =  ih~d-,  for  which  Xrt  —  £„).  Then  since  the 
general  solution  <[>(/)  can  be  written  in  the  form 

(0.  <7'20> 


we  find  that  the  average  value  of  the  energy  operator  E  =  Ih  -     is 


(7.21) 


Here  each  |Cn|2  is  the  probability  that  the  particle  is  in  the  corre- 
sponding quantum  state.  If  all  Cn  except  one,Cno,  are  equal  to  zero, 
then  E  =  Enn,  and  thus  the  average  energy  corresponds  to  the  eigen- 
value £n0.  Consequently,  the  eigenvalue  £no  corresponds  to  the 
experimentally  observed  energy.  In  the  case  where  several  co- 
efficients Cn  differ  from  zero:  Crt|f  Cn,, .  . . ,  Cni we  can  obtain 

any  of  these  values  of  energy  in  experimental  measurements.  If 
the  experiment  is  repeated  many  times,  the  number  of  measure- 
ments which  yield  an  energy  Eni  should  be  proportional  to  the 
corresponding  theoretical  probability  |  Cn  |*. 


Chapter  8 

Average  Values  of  Operators.     Change  of 
Dynamic  Variables  with  Time 

A.    DERIVATION  OF  THE  UNCERTAINTY  PRINCIPLE 

As  indicated  in  the  preceding  chapter,  the  observable  dynamic 
quantities  associated  with  the  operators  must  be  regarded  as 
average  values,  given  by  Eq.  (7.12),  This  is  true  regardless  of 
whether  these  operators  commute  with  the  Hamiltonian,  that  is, 
regardless  of  whether  the  corresponding  physical  quantities  are 
constants  of  the  motion. 

We  shall  now  show  that  if  t35^gjmamic  quantities  correspond 
to  noncommuting  operators,  they  do  not  have  simultaneous  definite 
values  in  quantum  mechanics.  Of  greatest  importance  in  this 
respect  is  the  calculation  of  the  deviations  from  the  average  values 
of  two  canonically  conjugate  quantities— the  position*  and  momentum 
px.  Our  discussion  will  be  carried  out  in  the  coordinate  represen- 
tation and  we  shall  restrict  ourselves  to  the  case  in  which  the  wave 
function  is  independent  of  time  (the  stationary  case).  Then  the 
average  values  of  the  position  and  momentum  can  be  found  from 
the  relations 

*'  (8.1) 

gd'x  (8.2) 

First  of  all,  we  note  that  even  though  the  average  error  or 
average  deviation  from  the  mean,  which  is  given  by 

==X--Z==Qt  (8.3) 

is  equal  to  zero,  it  in  no  way  follows  that  the  particle  cannot  occupy 
positions  other  than  x.  The  reason  for  this  is  that  the  deviations 
have  different  signs  relative  to  the  mean  x,  and  consequently  they 
cancel  out  on  the  average.  Accordingly,  the  deviation  from  the 
average  value  of  the  operator  should  be  characterized  by  the 
variance  (mean-square  deviation),  which  is  positive  for  all  deviations 
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from  x.  The  variance  of  the  position  can  be  calculated  from  the 
formula 

(Axp  =  j  <[>*  (x  —  x)^d^x  =  ^~2  (x)*  +  (xY  =  tf  —  (x)\        (8.4) 

We  note,  incidentally,  that  if  the  variance  is  (A*y2  =  0,  it  follows 
that  the  probability  of  the  electron  occupying  a  position  in  space 
differs  from  zero  only  at  x  =  x.  In  this  case  the  average  value  of 
the  position  is  equal  to  the  exact  value;  that  is,  the  corresponding 
probability  of  the  particle's  position  can  be  described  by  a  function 
similar  to  the  £  function. 

Similarly,  the  variance  of  the  momentum  is  given  by 


(8.5) 


In  order  to  establish  the  relationship  between  (Kxf  and  (Apj*f 
let  us  take  a  coordinate  system  whose  origin  lies  at  the  center  of 
mass  of  the  wave  packet  (,v  =  0),  and  which  moves  with  the  same 
velocity  as  the  center  of  mass  (p*  =  0).  (The  use  of  this  coordinate 
system  does  not  involve  any  restriction  on  the  generality  of  the 
discussion.) 

In  this  case 


(8.6) 
'* 


Let  us  consider  the  integral 

A-,  (8.7) 


=  J  ( 


where  a  is  some  arbitrary  real  quantity  independent  of  x.  Equation 
(8.7)  can  be  written  in  the  form 

I(*)  =  A**  —  B*  +  Ct  (8.7a) 

where 


A=  f  <]>*x*tyd*x= 


<8-8) 
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Since  the  integrand  in  (8.7)  is  essentially  a  positive  quantity 

/W^O,  (8.9) 

condition  (8.9)  imposes  a  certain  definite  restriction  on  the  co- 
efficients A9  B  and  C.  Indeed,  if  this  relation  is  satisfied  for  a  =  a0 
corresponding  to  the  minimum  of  the  function  /  (a),  it  will  also  be 
satisfied  for  any  arbitrary  a.  The  value  of  a0  can  be  found  from  the 
condition 

/'(«,)  =  2A**  -fl  =  0,  that  is,  *0=^,    ' 
and 


Hence  the  minimum  value  of  /  (a)  will  be  equal  to 

B* 


From  this  it  follows  that  inequality  (8.9)  will  hold  for  all  real  values 
of  at  provided  the  following  condition  is  satisfied: 

B**^4AC.  (8.10) 

Substituting  the  values  for  A,  B  and_Cfrom_(8.8)  and  using  (8.6), 
we  obtain  the  relationship  between  (ApJ8  and  (Ax)* 

fA¥)MW^°.  ^  (8-11) 

This  inequality  represents  a  rigorous  formulation  of  the  uncertainty 
principle. 

By  using   the   relation  pxx  —  xpx  =  —  ih  [see  Eq.   (7.7)],  we  can 
rewrite  (8.11)  in  the  form 


xpx  \\  (8.12) 

Generalizing  (8.12),  we  can  say  that  whenever  two  operators  Mt  and 
M2  do  not  commute  with  each  other,  they  satisfy  the  uncertainty 
relation 


•  (AAf  ,)'  ^  j  |  MjM,  —  MJVIj  |9,  (8.13) 

where 


=  J  r  (M,  -  AT,)'  f  d^  (/  =1,2).  (8.14) 
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As  we  have  already  mentioned,  the  uncertainty  principle  is  a 
consequence  of  the  wave-particle  duality  that  underlies  quantum 
mechanics  and  is  in  no  way  connected  with  the  experimental  limita- 
tions. Experiments  may  only  prove  the  results  which  follow 
from  the  uncertainty  principle.  The  basic  meaning  of  the  uncertainty 
principle  consists  in  the  following  fact:  the  probability  distributions 
of  variables  whose  operators  do  not  commute  cannot  simultaneously 
take  the  form  of  a  8  function  (see  Fig.  8.1).  Moreover,  if  the  prob- 
ability distribution  of  one  variable  approaches  a  S  function,  the 
probability  distribution  of  the  other  variable  will  spread  out.  In 
the  limit,  when,  for  instance,  the  probability  function  for  x  (that 
is,  |<K*)|*)  takes  the  form  of  a  I  function  [(A*)*  =  0],  the  probability 

function   for   the   momentum    px  (that  is^ [<p(A*)|')  becomes  such 

that  it  is  constant  for  all  values  of   px    (A/?jr)2  =  oo. 


Fig.    8.1.   The   probability   distribution   function   in   (a)   coordinate 

space  and  (b)  momentum  space:   [(Arc)2  (Ap)2]l/J  =  H/2. 
If  the  distribution  in  coordinate   space  (a)  contracts,  the  distribu- 
tion in  momentum  space  (b)  spreads. 

The  necessary  condition  for  simultaneous  measurement  of  two 
dynamic  quantities  is  the  condition  of  commutativity  of  their 
corresponding  operators. 


B.    POISSON  BRACKETS  IN  CLASSICAL  AND  QUANTUM 

THEORY 

The  state  of  a  system  in  classical  mechanics  is  defined  by  its 
dynamic  variables.  The  quantities  appearing  in  the  canonical,  or 
Hamilton's,  equations  of  motion  in  classical  mechanics  depend  on 
the  coordinates  x(,  momenta  pt  and  time  t,  that  is, 


/=/(p|f  *„  t). 


(8.15) 
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For   example,   in   a   one-dimensional   time-independent  problem, 
the  Hamiltonian  depends  only  on  x  and  px: 


With   the   help   of  the  canonical  (Hamilton's)  equations  of  motion 
we  obtain 

*-".-£  Md  '-—£—£.  <8-17> 

or 


If  there  are  n  degrees  of  freedom  (/=!,  2,...,  /*),  Eqs.  (8.17) 
take  the  form 

*       dH       .  _      dH  /ft  17av 

*'  =  ^'    H  —  dxS  (8'17a) 

Hence  the  time  rate  of  change  of  the  quantity  f  [see  (8.15)]  is  given 
by  the  equation 


Using  the  canonical  equations  (8.17a),  we  obtain 


di=di~\  IH.  f],  (8.18) 


where  the  expression 


is  called  the  classical  Poisson  bracket. 

If  /  does  not  depend  explicitly  on/,  then  ?f=  0  ,  and  consequently 
the  variation  of  /  is  completely  determined  by  the  Poisson  bracket: 

g=l//,  /I.  (8.20) 

If  the  Poisson  bracket  vanishes  ([//,  /]  =  <)),  the  quantity  /  does  not 
depend  on  time,  or  it  is  conserved. 

/  =  const. 
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For  example,  if  the  energy  does  not  depend  explicitly  on  time,  then 
<5///a/  =  0.  Since  obviously  [//,  //]  =  0f  it  follows  that  Hamilton's 
function  (the  energy  in  this  case)  is  a  constant  (//  =  const).  Further- 
more, substituting  the  coordinate  xl9  and  then  the  momentum  pt  for 
/  into  (8.20),  we  obtain  the  relations  (8.17a),  that  is,  Hamilton's 
equations  of  motion. 

We  shall  now  generalize, the  classical  Poisson  brackets,  which 
can  be  used  to  fincnne  time  variati_gn_gijin^^  to 

*he  quantumj caseJ  """""  '"  ~" 

i'irst  oF"all,  we  recall  that  in  quantum  mechanics  physical 
meaning  can  be  attached  only  to  the  average  values  of  operators 
(position,  momentum,  and  so  on).  It  is  the  time  rate  of  change  of 
these  average  values  that  we  must  determine.  The  average  value  of 
any  operator  f  is  given  in  quantum  mechanics  by  Eq.  (7.12),  in  which 
the  time  /  occurs  as  a  parameter.  From  this  equation,  we  can  find 
the  total  derivative  of  f  with  respect  to  time: 


=  f  /(')  It  (/)  **  +  \  d\"}  ft  (/)  <px+ 

J  at  J       ot 


Substituting  for  ~-        and          the  expressions  ^  H^*)  and(-  ~ 
respectively,  we  can  reduce  (8.21)  to  the  form 


+  {  J  [(H  t*  (0)  (f  t  (0)  -  **  (0  f  (H  *  (/))]  d*x,  (8.22) 

where 


Using  the  theorem  for  transferring  a  derivative  [Eq.  (7.17)]  and 
keeping  in  mind  that  the  potential  energy  is  an  ordinary  function  of 
coordinates,  we  readily  obtain 


J  r 


Consequently,  the  change  of  7  with  time  will  be  given  by  the  equation 


(8.23) 
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The  expression 

JH,  f}=4(Hf~fHJ|  (8.24) 

is  the  generalization  of  the  Poisson  bracket  (8.19)  to  the  quantum- 
mecanical  case  and  is  called  the  quantum  Poisson  bracket. 

Obviously,  in  the  case  where  57  =  0  (as  a  rule  an  operator  f  does 
not  contain  the  time  explicitly),  Eq.  (8.23)  becomes 


It  follows  that  in  this  case  the  time  change  of/  is  completely  deter- 
mined by  the  quantum  Poisson  bracket.  Furthermore,  if  the  opera- 
tor f  commutes  with  the  Hamiltonian  operator  H,  the  physical 
quantity  /  corresponding  to  this  operator  is  conserved,  as  can  be 
seen  from  (8.25). 

With  the  help  of  (8.25)  it  is  easy  to  prove  that  the  energy  of  a 
particle  moving  in  a  time-  independent  potential  field  V  (r)  is  con- 

served. The  expression  {H,  H}=-^-(HH  —  HH)  vanishes  in  this 
case  and  therefore  from  (8.25)  we  have 

W  =  const.  (8.26) 

On  the  other  hand,  l\tyn  =  Entyn  according  to  the  time-  independent 
Schrbdinger  equation,  and  therefore,  when  <1>  =  '])  (0»  we  have  [see 
Eq.  (7.20)] 


H  = 

that  is,  Eq.  (8.26)  is  nothing  but  the  law  of  conservation  of  energy 
(E  =  const)  for  a  particle  moving  in  a  time-independent  field  of 
force. 


C.    EHRENFEST'S  THEOREM 

We  shall  now  find  the  quantum  analog  of  the  classical  equations 
of  motion  (8.17).  For  this  purpose  we  shall  use  the  quantum  Poisson 
brackets.  Noting  that  x  and  p*  do  not  contain  the  time  explicitly, 
let  us  use  Eq.  (8. 25)  to  determined  and  ~pX9  substituting  into  it  either 
f  =  x  or  f  =  pje,  as  the  case  may  be.  In  the  case  of  f =x,  we  find 


jfc={H,  x}=  ln  (H*  — 
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where 


.  (8.28) 

Since  x  and  V  (x)  commute,  Eq.   (8.27)  can  be  reduced  to  the  form 


Adding  the   quantity    (vxxpx  —  pxxpx)  to  the  right-hand  side  of  this 
equation,  we  have 


:  —  xpx)px).  (8.30) 

Then,  using  Eq.  (7.7),  we  obtain 

i=m*'  (8-31) 

In  order  to  determine  the  time  rate  of  change  of  the  momentum 
we  must  substitute  the  momentum  operator   p*  for  the  operator  f 

in  (8.25).  Then,  since     pxp"x  —  p!p*  =  0,  we  find  for  px 

7)  =  -?.  (8.32) 


Hence,  using  (8.31),  we  obtain 

F(J).  (8.33) 


Equations  (8.31)-(8.33)  constitute  Ehrenfest's  theorem,  according 
to  which  the  fundamental  equations  of  classical  mechanics  can  be 
generalized  to  quantum  mechanics  by  replacing  the  classical 
variables  by  the  average  values  of  the  corresponding  operators. 


D.    TRANSITION  FROM  QUANTUM  TO  CLASSICAL 
EQUATIONS  OF  MOTION 

Let  us  compare  the  classical  equation  of  motion 

(8.34) 


with  the  corresponding  quantum-mechanical  form  (8.33).  As  was 
previously  stated,  x  is  the  quantity  which  corresponds  to  the 
classical  position  coordinate  in  quantum  theory.  Accordingly,  we 
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could  assume  the  quantum-mechanical  equation  to  be  identical  with 
the  classical  equation  if  we  had 

(8.35) 


instead  of  (8.33).  This  would  be  equivalent  to  replacing  *  by  its 
average  value  x  in  the  classical  equation  relating  the  force  and  the 
position.  Ehrenfest's  theorem  asserts,  however,  that  the  equation 
of  motion  for  the  quantum  case  contains  the  average  value  of  the 
actual  force,  that  is,  F  (x).  Therefore,  in  order  to  make  a  transition 
from  quantum  equations  of  motion  to  classical  equations,  we  must 
first  establish  the  relationship  between  F  (x)  andF(Jf). 
Let  us  represent  the  force  operator  F  (x)  in  the  form 

F(x)  =  F(x  +  bx),  (8.36) 

where    bx  =  x—  x,   and   expand  F(x)    in  a  Taylor  series  about  the 
point  x  =  x.    Then  we  obtain 

F(x)  =  F(X)+(*x)F'(x)  +  ^-F'(x)+....  (8.37) 

Taking  the  average  of  this  expression  in  accordance  with  Eq.  (7.12) 
and  considering  that  "(A*)  =  (jc  —  x  )  =  0,  we  obtain 


£-(*)+  ...  .  (8.38) 

The  quantum-mechanical  equation  of  motion  (8.33)  therefore,  takes 
the  form 

mQx  =  F(x)  +  (^F»(X).  (8.39) 

Here  the  expression  --  •*--  F"  (x)  is  the  quantum-mechanical  correction 

to  Newton's  classical  equation.  Clearly,  the  criterion  which  must 
be  satisfied  in  transition  from  quantum  equations  of  motion  to 
classical  equations  is  the  inequality 


F(x) 


(8.40) 


It  should  be  noted,  however,  that  mere  satisfaction  of  this  inequal- 
ity is  still  not  sufficient  to  allow  us  to  apply  all  classical  concepts 
to  the  description  of  the  motion  of  a  particle.  Indeed,  in  quantum 
mechanics  the  average  value  of  the  kinetic  energy  f  is  defined  as 
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whereas  the  classical  analog  of  the  quantum-mechanical  kinetic 
energy  should  actually  be  taken  as 

T(Px)  =  J^-t  (8.42) 

Let  us  now  express  the  quantum-mechanical  definition  of  the  kinetic 
energy  7'(p.v)  in  terms  of  its  classical  analog  T(px).  For  this  pur- 
pose, we  shall  use  the  equation 

where  bpx  =  px  —  px.  Removing  the  parentheses  in  (8.43)  and  con- 
sidering that  after  averaging 


we  have 


T)  -  T  (AV)  +  --  (A/a2.  <8-44) 


From  this  we  obtain  the  condition  under  which  we  can  make  a 
transition  from  the  quantum-mechanical  expression  for  the  kinetic 
energy  (8.41)  to  the  classical  expression 


(^)X  pi  =  2m0T  (/>,).  (8.45) 

Multiplying  (8.45)  by  (8,40),  we  obtain  the  general  condition  for 
the  validity  of  the  classical  approximation  in  the  microscopic  world: 


(A*)5  •  (Ap,)2  <  4mor  (px)    -^L  .  (8.46) 

If  we  take  into  account  the  uncertainty  relation 

rrT2   TA~T2^  — 
condition  (8.46)  becomes 

r-i  /—  \  *  « 

(8.47) 


16  ' 

Let  us  apply  this  condition  to  the  hydrogen  atom,  when 
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Substituting  these  values  into  (8.47),  we  obtain  the  inequality 

'>-ri£r-  <8-48> 

Since  — ^  =  a0,  where  a0    is  the  radius  of  the  first  Bohr  orbit,  and 

fllQPQ 

r  =  /z2a0,  we  obtain  instead  of  (8.48) 

,  (8.49) 

and  therefore,  in  the  limit  of  large  quantum  numbers,  the  results 
of  quantum  theory  approach  the  classical  results. 


roblem  B&*jjJ  Determine  the  wave  function  of  a  freely  moving  electron  in  the 
p  represe'nt'a'tlonr  Write  the  normalization  condition  in  the  p  representation. 

Find  the  average  values  of  the  operators  for  the  momentum  and  energy  of  a  particle. 
Solve  the  problem  in  the  one-dimensional  case,  and  then  generalize  it  to  the  three- 
dimensional  case. 

Solution.  Let  us  choose  the  x  axis  along  the  direction  of  motion  of  the  electron.  The 
wave  function  of  a  free  electron  in  the  x  representation,  normalized  in  terms  of  fc  (p'0  —po\ 
will  have  the  form  [see  (4.81)] 


To  transform  to  the  p  representation,  we  use  Eq.  (7.  10)  in  the  form 

9  (Po,  P)  =  -~^  J  *(/>„  x')  <f  '  T  *'  dx',  (8.50) 

obtaining 

T  (Po,  P)  =  *(P  —  Po)' 
The  normalization  condition  has  the  same  form  in  both  x  and  p  representations: 


=  V  fp*(pi, 

•> 


The  average  value  of  the  operators  should  be  calculated  from  the  equations 

Po  +  AP  Po  +  AP 

M  =     J     dp'fy*  (p'09  x)  Mi]/  (p0,  x)  dx  =     j     dpi  J  cp*  (p'w  p)  M  <p  (pQ,  p)  dp. 

Po  —  Ap  p0  _  Ap 

which  gives  us 


The  problems  in  this  chapter  refer  to  Chapters  6  and  7  as  well. 
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In  the  three-dimensional  case,  we  have 

-_1—  .—^  <f>  (Po,  P)  =  *  (P  —  Po). 

^Problem  8.2.jDeterrmne  the  probability  of  the  various  values  of  the  momentum  of  a 
parfrllu  in  Un   ground   state,  the  particle  being  in  a  one-dimensional  square  well  with 
infinitely  high  walls.  Verify  the  normalization  in  the  p  presentation. 
Solution.  Taking  the  value  of  the  wave  function  from  Chapter  4 


and  using  Eq.  (8.50),  we  have 


~~    ~7T x 
xe  dx. 


Evaluating  the  integral   and   squaring  its   modulus,  we  obtain  the  required  probability 
distribution 


which  satisfies  the  normalization  condition 


—  oo 

In  evaluating  the  last  integral  we  may  use  the  relation 

+  00 


Jcos  ap  dp  __     sin  \a\b 
b*-p*  -n        'b        ' 


—  oo 


which  should  then  be  differentiated  with  respect  to  the  parameter  b. 
^Jroblemj^jj^lnvestigate  the  motion  of  a  charged  particle  in  a  constant  and  uniform 

Solution.  This  problem  is  solved  most  simply  in  the  momentum  representation.  Since, 
according  to  Eq.  (7.  1  1),  the  potential  energy  in  momentum  space  can  be  represented  in 
the  form 


the  corresponding  Schrbdinger  equation  in  momentum  space  becomes 


The  solution  of  this  equation  is 

cp  (E,  p)  =  —L=  e™  (EP  "  ^  • 

Y^np 

where,  because  of  the  continuity  of  the  spectrum,  the  normalization  coefficient  was  found 
from  the  condition  for  8-f  unction  normalization: 

=  »(£'-/:). 
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The  wave  function  in  the  position  space  can  be  determined  with  the  help  of  Eq.  (7.9) 


where 

oo 

[~  — «5)ef« 


is  the  Airy  function,  which  is  proportional  to  the  Bessel  function  of  order  1/3  (see 
Chapter  5),  and  £  =  f  x  +  —„  j  f-*|~)  * .  Examining  the  asymptotic  behavior  of  the  Airy 
function 


1    '    for  5  <  0, 


-sin 


)l/4 

it  is  readily  shown  that  the  region  of  large  negative  values  of  jc,  where  —  Fx>  E,  repre- 
sents a  potential  barrier,  whereas  the  region  where  E  >  —  Fx  is  quasi-classical, 
because 


-  S  =  -i-  f  /?</A"  =  ~  f  /2/H0  (£  +  ^)  dx  +  const  = 


1 

* 

2^ 
3 


electron  moves  in  a  constant  and  uniform  magnetic  field.  Find  the 
time  'SSrWatwe   of  the  average  value  of  the  position  and  momentum  of  the  electron  (in 
other  words,  generalize  the  Ehrenfest  theorem  for  the  case  of  motion  in  a  magnetic  field). 
Solution,  According  to  Eq.  (5. 9 a),  the  Hamiltonian  of  an  electron  in  a  magnetic  field  is 


Choosing  the  direction  of  the  uniform  magnetic  field  to  be  along  the  z  axis  (Hz  ^  O,//^ 
—  7/^  =  0),  the  field  can  be  specified  by  the  vector  potential 


In  order  to  determine  the  time  rate  of  change  of  the  electron's  position,  let  us  use  the 
quantum-mechanical  equation  of  motion 

v  =  l{Hr-rH}. 
Substituting  H,  we  readily  find 

v==J_(p  _.!„)>  (8.51) 

W0      \  C  j  MQ 

For  the  time  derivative  of  the  x  component  of  the  momentum  operator,  we  obtain 
^-i/KP  -P  H)-  —  fo    --A  \H  -~v  H 

dt  ~  n  (KP*    F*M]-  m.c  \py     c  Ay)  "*  -  c  vy"s' 
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Similarly,  it  can  be  shown  that 

dt  c     *   *'      dt 

Combining  these  equations,  we  have 

^  =  L  [y//].  (8.52) 

Equations  (8.51)  and  (8.52)  constitute  the  required  result. 

i  8.5J  As  we  know,  the  behavior  of  an  electron  in  a  metal  (x  <  0)  can  be 
described  wifn  a  sufficient  degree  of  accuracy  by  the  following  potential  energy  function 
(see  Chapter  6): 


Determine  the  coefficient  of  reflection  from  the  surface  of  the  metal  for  electrons  located 
inside  the  metal  (x  <  0)  in  the  following  cases:  (a)  E  <i  V0  and  (b)  E  >  VQ.  Show  that,  even 
though  in  case  (a)  the  electrons  do  penetrate  into  the  region  (*>0),  ultimately  they  return 
back  into  the  metal.  Construct  a  graph  of  the  change  of  the  potential  energy  and  of  the 
wave  function  of  the  moving  electrons. 
Answer. 

a)  ForE  <  V0l  /?==!,  even  though  ty  (jc>  0)  ^  0. 


Hint.  In  choosing  the  solution  for  x  >  0  (outside  the  metal),  only  the  exponentially 
decreasing  solution  should  be  retained  in  case  (a),  and  only  the  solution  corresponding  to 
a  wave  traveling  along  the  x  axis  in  case  (b). 


Chapter  9 

Elementary  Theory  of  Radiation 

A.    SPONTANEOUS  AND  INDUCED  TRANSITIONS 

According  to  classical  electrodynamics,  an  accelerated  charge 
is  a  source  of  electromagnetic  radiation.  The  amount  of  energy 
radiated  per  unit  time  is  given  by  the  well-known  equation1 


where  r  =  w  is  the  acceleration  of  the  particle. 

If  the  source  of  radiation  is  a  one-dimensional  harmonic  oscil- 
lator 

x  —  acosu>tt  (9.2) 

the  frequency  of  the  emitted  radiation  is  the  same  as  the  mechanical 
frequency  of  vibration  of  the  oscillator,  and  its  intensity  is  propor- 
tional to  a2. 

In  the  case  where  the  motion  of  a  charge  is  governed  by  a  more 

complicated  periodic  function  x  =  f(t)  with  a  period  T  =  —  ,  we  can 
expand  the  function  f(t)  in  a  Fourier  series: 

x  =  y  ak  cos  coW,  (9.2a) 

u 

and  treat  the  radiation  as  if  it  were  generated  by  a  set  of  oscillators 
with  frequencies  («/,  =  ku>,  where  k=\9  2,  3,  ...  .  Radiation  will  be 
emitted  both  at  the  fundamental  frequency  (k=  1)  and  at  harmonics 
feo  of  the  fundamental  frequency.  The  intensity  corresponding  to 
the  kth  harmonic  will  be  proportional  to  aj. 

Thus,  according  to  classical  theory,  the  radiation  of  a  system 
is  completely  determined  by  its  mechanical  properties.  Indeed,  the 


In  this  chapter  the  quantum-mechanical  averages  will  be  distinguished  from  time 
averages  by  writing  the  latter  with  the  subscript  "av  *  In  accordance  with  the  previous 
notation,  quantum-mechanical  averages  will  be  indicated  by  a  bar. 
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frequency  of  the  radiation  is  either  equal  to  or  is  a  multiple  of  the 
mechanical  frequency  of  oscillation  of  the  system,  and  the  intensity 
of  the  corresponding  harmonic  is  proportional  to  the  square  of  the 
amplitude. 

In  quantum  mechanics,  the  problem  of  radiation  must  be  ap- 
proached in  a  somewhat  different  manner.  According  to  quantum 
theory,  radiation  is  emitted  only  when  a  particle  (or  a  system) 
makes  a  transition  from  one  energy  state  to  a  lower  energy  state 
(so-called  "downward"  transition). 

The  first  quantum  treatment  of  the  problem  of  radiation  was 
proposed  in  1917  by  Einstein.  He  introduced  the  coefficients  A  and 
B  (now  called  the Eins tein  coefficients)  to  characterize  the  spontane- 
ous transitions  and  the  induced  transitions  (that  is,  transitions  due 
to  some  external  effects)  of  a  system  from  one  energy  level  to 
another;  Einstein  also  obtained  an  equation  relating  these  two 
coefficients. 

The  basic  elements  of  the  quantum  theory  of  radiation  are  the 
following.  Suppose  one  of  the  electrons  of  an  arbitrary  atomic 
system  is  in  the  excited  state  n  with  an  energy  £„.  Then  there  is 
a  definite  probability Anfl'  per  unit  time  of  a  spontaneous  transition 
of  this  electron  into  a  lower  energy  state  n'  with  an  energy  En>. 
The  transition  is  accompanied  by  the  emission  of  a  photon  with  an 
energy  ftu  =  En  —  En>.  If  the  number  of  excited  atoms  is  equal  toNn, 
the  energy  radiated  per  unit  time  during  spontaneous  transitions 
only  can  be  written  as 

iv/spon  KJ    A      4.  /q  ov 

"em      —"nAnn'nu.  \V.3) 

When  the  atoms  are  subjected  to  the  influence  of  external  electro- 
magnetic radiation,  the  latter  will  cause  both  upward  and  downward 

induced  transitions.  The  up  ward  transitions 

f          will,  of  course,  be  associated  with  the  ab- 
sorption of  photons. 

Adopting  the  notation  introduced  by  Ein- 
stein, we  designate  the  probabilities  of  an 
induced  transition  from  level  n  to  n'  by  Bnn> 
and  from  level  n'  to  n  by  £„'„•  Since  the 
number  of  induced  transitions  should  be 
proportional  to  the  spectral  energy  density 
Pa  of  the  external  radiation,  we  obtain  the 
following  equations  for  the  energy  radiated 
and  absorbed  per  unit  time  in  induced 
transitions: 


Ann' 


Fig.  9. 1.  Downward     tran- 
sitions (spontaneous    and 
induced)       and       upward 
transitions  (induced) 


vind 
em 


where  Nn>  is  the  number  of  atoms  in  state  n'. 


(9.4) 
(9.5) 
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Let   us    consider  the  case  in  which  the  number  of  upward  and 
downward  transitions  is  the  same  (see  Fig.  9.1): 

NnA  nn'  +  Nn^Bnn>  =  Nn^Bn'nt  (9.6) 

that  is,  when  a  state  of  the  rmodynamic  equilibrium  exist  between 
the  heated  atoms  and  the  light  radiated  by  them  (black-body  radi- 
ation), which  in  turn  interacts  with  the  atoms.  In  this  state,  the 
atoms  and  the  radiated  light  form  a  closed  system. 

Since,  in  this  case,  the  energy  distribution  of  the  electrons  is 
given  by  the  Maxwell  distribution 


we  obtain 

Ann'C-Kn'*T+  ^nn^  En"T  =  ?»Bn.ne  ~En'^t  (9.7) 

Dividing  by  the  factor  e-En/kl  and  noting  that  En  —  En>~ft(*>t  we  obtain 


Since  the  spectral  energy  distribution  of  black-body  radiation  is 
completely  independent  of  the  specific  structure  of  the  atoms  or 
molecules  involved,  Eq.  (9.8)  is  essentially  the  same  as  Planck's 
formula  [see  (1.42)] 

pw  =  _*l".L_^_.  (9.9) 

omparing  (9.8)  with  (9.9)  we  find 

Ban.  =  Bn>n  =  -^-3-  Ann'.  (9.  10) 

It  is  seen  from  Eq.  (9.10)  that  the  probability  coefficients  of  upward 
and  downward  induced  transitions  are  equal  to  each  other  and 
proportional  to  the  coefficient  of  spontaneous  trans ition/4nn'.  There- 
fore, to  describe  the  radiation  of  atoms  or  molecules,  it  is  sufficient 
to  determine  only  one  of  these  coefficients. 

B.     CALCULATION  OF  PROBABILITIES  OF  SPONTANEOUS 
AND  INDUCED  TRANSITIONS 

In  quantum  mechanics  induced  transitions  are  explained  in  terms 
of  an  interaction  between  the  electrons  of  an  atom  and  external 
electromagnetic  radiation.  The  problem  of  determining  the  causes 
of  spontaneous  transitions  was  left  unexplained  by  the  Schrodinger 
theory. 
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The  answer  was  obtained  only  after  the  development  of  a  theory 
of  radiation  in  which  quantization  of  the  electromagnetic  field 
(second  quantization)  was  used.  The  general  features  of  the  theory 
are  outlined  below. 

Electrons  interact  not  only  with  real  photons,  but  also  with 
virtual  photons  (photons  which  are  in  an  unobservable  state)  or,  as 
they  are  called,  vacuum  fluctuations  of  the  electromagnetic  field 
(for  further  details  on  vacuum  fluctuations,  see  Chapter  22).  This 
interaction  causes  spontaneous  transitions.  The  classical  analog 
of  the  interaction  between  the  electrons  and  the  field  of  virtual 
photons  is  the  effect  of  Planck's  radiation  damping  on  a  moving 
electron 


which  represents  the  self-interaction  of  the  electron  with  its  own 
electromagnetic  field.  Under  certain  conditions  this  electromagnetic 
field  may  detach  itself  from  the  electron  in  the  form  of  electro- 
magnetic radiation.  In  the  language  of  quantum  electrodynamics 
this  amounts  to  a  transition  of  photons  from  a  virtual  state  into  a 
real  state. 

The  exact  expression  for  the  coefficients  A  and  B  can  be  found 
on  the  basis  of  quantum  electrodynamics  and,  therefore,  problems 
of  radiation  can  be  completely  solved.2 

In  the  present  discussion  we  shall  obtain  coefficient  A  by  means 
of  an  appropriate  generalization  of  the  results  of  classical  radiation 
theory  to  the  quantum  case.  It  should  be  emphasized  that  this 
generalization  leads  to  the  same  results  as  the  rigorous  method  of 
second  quantization. 

In  our  derivation  we  shall  use  the  correspondence  principle  to 
generalize  the  classical  expression  for  the  radiated  energy  [Eq. 
(9.1)]  to  the  quantum  case.  First,  we  replace  the  classical  variable 
r  by  the  quantum-mechanical  quantity 

.  (9,11) 

In  addition,  we  use  an  expression  for  the  radiated  energy  which  is 
consistent  with  the  quantum  theory: 


where   the   coefficients    gn  and  gnf  characterize  the  occupancy  of 
states  n  and  n'  by  electrons,  since  according  to  the  Pauli  exclusion 


2 
See  also  Chapter  29,  where  the  coefficient  A  is  obtained  by  the  methods  of  quantum 

electrodynamics. 
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principle  it  is  impossible  for  two  electrons  to  be  in  the  same 
quantum  state  (for  more  on  the  Pauli  exclusion  principle,  see 
Chapter  24). 

Combining  Eqs.  (9.12)  and  (9.1)  and  substituting  (9.11)  we  obtain 

(9.13) 

Let  us  note  that  Eq.  (9.13)  contains  two  averages.  One  is  the 
quantum  -mechanical  average,  denoted  by  a  bar,  and  the  other  is 
the  time  average,  denoted  by  the  subscript  "av." 

We  shall  now  assume  that  the  electron  has  only  two  possible 
states  with  energies  En  and  En>.  Then  the  wave  function  can  be 
written  as 

y(t)=Cne~  »  V<M  Cn>e~  **"''  <[v.  (9.14) 

The  average  (over  the  quantum*  mechanical  states)  value  of  the 
radius  vector  is 

r  =  |  Cn  !'/•„„  +  1  Cn.  |*  rn,n.  +  CJC^-'/w  +  C^Cne^rn^         (9.  15) 
where 


The  matrix  elements 


form  a  certain  infinite  matrix 


(9.17) 


From  (9.16),  it  follows  that  this  matrix  changes  into  its  complex 
conjugate  when  the  rows  are  replaced  by  columns,  and  columns 
by  rows 


Matrices  satisfying  this  condition  are  called  Hermitian  or  self- 
adjoint  matrices.  Let  us  also  emphasize  that  the  matrix  elements 
(9.16)  are  independent  of  time,  and  therefore  substitution  of  (9.15) 
into  (9.13)  yields 

gngn>fK»Ann>  =         ^  \  Cn  |2|  Cn,  |*  |  rn'n  |*.  (9-18) 
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Here   we   have   used   the    fact  that  the  time  average  of  a  periodic 
function  is  zero,  since 


V 

For  further  analys  is  of  Eq.  (9.  18)  ,  we  must  introduce  an  additional 
assumption,  which  can  be  rigorously  justified  only  on  the  basis  of 
quantum  electrodynamics.  As  we  already  know,  quantum  mechanics 
deals  with  stationary  processes  and,  therefore,  there  is  no  am- 
biguity in  interpreting  the  quantity  \Cn\l~  const  as  the  probability 
of  finding  an  electron  in  the  state  n.  When  the  emission  of  radiation 
is  present,  the  coefficients  Cn  change  dis  continuously  and  their 
physical  meaning  cannot  be  simply  explained  within  the  usual 
formalism  cf  quantum  mechanics.  We  shall,  therefore,  base  our 
conclusions  on  simple  physical  considerations,  which  are  rigor- 
ously proved  only  in  quantum  mechanics. 

Let  us  substitute  the  initial  values  for  the  coefficients  CJ  into 
Eq.  (9.18),  bearing  in  mind  that  the  Pauli  exclusion  principle  allows 
transition  only  in  the  case  when  the  quantum  state  n  is  initially 
occupied,  while  the  quantum  state  n'  is  empty.  Then  setting 

8*8n'  =  \Cn\*\Cn.\*  =  \C'n\*(\  —  \C*n\*),  (9.19) 

we    find   that    for    d  =  l    and  C,i'  =  0  the  product  gngn*=  1.  Hence 


Ban.  =  Bn.n  =  *      |  rn.n  \\  (9.21) 

y  '- 


Wm.  =  H»Ann>  =      '--  |  /•„-„  \\  (9.  22) 


In  these  equations 

\rn>nl*  =  \Xn'n\*+\yn>n\*  +  \Zn'n\\  (9.23) 

where 


and  so  forth. 

Thus,  the  energy  eigenvalues  can  be  used  to  find  the  frequency 
of  the  radiation  and  the  eigenfunctions  to  find  its  intensity.  Thus, 
all  basic  classical  radiation  properties  can  be  completely  general- 
ized to  the  quantum  case  by  means  of  the  Schrodinger  equation. 

From  the  last  equation  above  it  is  evident  that  the  intensity  of 
radiation  will  be  different  from  zero  only  for  those  transitions  for 
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which  at  least  one  of  the  matrix  elements  xnn>t  ijnn>  and  znn>  is  non- 
zero. These  transitions  are  called  in  quantum  mechanics  the 
allowed  transitions. 

It  should  be  noted  that  in  very  many  quantum-mechanical 
problems  it  is  sufficient  to  calculate  the  matrix  elements  alone  and 
thus  to  set  up  selection  rules,  thai  is,  to  find  the  changes  in  quantum 
numbers  that  correspond  to  allowed  transitions.  From  a  knowledge 
of  the  selection  rules,  one  can  answer  the  question  of  possible 
frequencies  of  radiation.  In  the  language  of  classical  electro- 
dynamics, the  selection  rules  correspond  to  a  specification  of  the 
harmonics  at  which  radiation  can  be  emitted  by  a  given  system.  If 
the  matrix  elements  for  a  given  change  (difference)  in  quantum 
numbers  are  equal  to  zero,  there  will  be  no  radiation  at  the  corre- 
sponding frequencies,  and  these  transitions  are  said  to  be  forbidden. 

Here,  in  speaking  oi  forbidden  transitions,  we  are  restricting  the 
use  of  this  term  to  electric  dipole  transitions.  By  electric  dipole 
transitions  we  simply  mean  transitions  whose  probability  depends 
on  matrix  elements 


In  addition  to  the  dipole  transitions,  there  are  also  cases  of  quad- 
rupole transitions,  multipole  transitions  of  higher  orders,  and 
magnetic  dipole  transitions.  The  intensity  of  these  transitions  turns 
out  to  be  much  smaller  than  that  of  the  allowed  dipole  transitions. 
As  an  example,  if  the  intensity  of  an  electric  dipole  transition  is  of 
the  order 

w,  2  p2      <*<  (    V2 

w  dipole  =  y  73-  ~  fv  \ea)  » 

where  a  is  the  linear  dimension  of  the  atom,  then  the  intensity  of 
electric  quadrupole  radiation  is  of  the  order 3 

»Vad  ~  ^dipole    -  (f  )*•  <9'24> 

For  an  atom  a~  10  8  cm  and  X~  10  5cmand,  therefore,  the  intensity 
of  dipole  radiation  is  10  times  greater  than  that  of  quadrupole 
radiation.  Nevertheless,  quadrupole  radiation  plays  a  very  important 
part  in  a  number  of  phenomena.  Indeed,  if  the  electric  dipole 
transition  is  forbidden,  it  is  still  possible  that  a  weak  quadrupole 
radiation  will  be  emitted,  which  can  be  detected  with  a  very  sensitive 
spectroscope.  We  note  that  no  dipole  radiation  occurs  in  a  system 
consisting  of  particles  having  the  same  charge  to  mass  ratio.  The 


3This   subject  is  treated  more  fully  for  the  case  of  a  harmonic  oscillator  in  Chapter  10 


[see  Problem  (10.4)]. 
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electric  dipole  moment  of  such  a  system  is  proportional  to  the 
coordinate  of  the  center  of  mass /^-m- [p  — e(rl  +  r2)  =  2^mj,  and 
therefore  the  derivative  of  the  dipole  moment  with  respect  to  time 
vanishes.  This  is  the  situation  that  should  hold  for  gravitational 
radiation,  since  the  gravitational  charge,  or  rather  the  gravitational 
mass,  is  proportional  to  the  inert  mass  mu.  Therefore,  if  gravita- 
tional radiation  does  exist  at  all,  it  can  only  be  of  quadrupole 
character.  Quadrupole  radiation  is  also  of  importance  in  nuclear 
physics  since  the  charged  particles  of  the  nucleus  (protons)  have 
the  same  charge  and  mass.4 

Find  the  probability  of  quadrupole  radiation  in  the  quantum  case  as  a 

classical  formula  by  applying  the  corresponding  principle. 

Solution.    In  the  classical  case  the  intensity  of  quadrupole  radiation  is  given  by  the 
equation 


where  the  quadrupole  moment  is 

Dab  =  e  (Zxaxb  —  r^ab)  (a,  b  =  1,2,3). 

To  generalize  (9.25)  to  the  quantum  case,  it  is  necessary  to  consider  that,  in  quantum 
theory,  radiation  occurs  as  a  result  of  a  transition  of  the  system  from  one  quantum  state 
n  to  another,  /t'.  Following  the  procedure  similar  to  the  derivation  of  Eqs.  (9.13)-(9.18), 
we  first  replace  the  classical  expression  for  the  quadrupole  moment  Dab  by  the  matrix 
element 

(Dn'n)ab= 

Next,  using  Eq.  (9.3),  which  relates  the  intensity  of  radiation  Wn>n  to  the  emission 
probability  An,nt  we  obtain 

Ann>  =  ^"^  (/Waft  Wn'Jab,  (9.26) 

where  the  frequency  of  radiation  o>  is  given  by  Eq.  (9. 15a). 

s—~~~~*\ 

^Problem  9. 2.  i  Find  the  selection  rules  for  dipole  and  quadrupole  radiation  for  a  parade 
in  anmniffWtytreep  potential  well. 

Answer.  For  dipole  radiation  A/I  must  be  an  odd  number,  and  for  quadrupole  radia- 
tion A  n  is  an  even  number. 

Hint.  Using  the  wave  function  (4.32),  it  can  be  shown  that  the  average  value  of  the  x 
coordinate  is 


Therefore,  the  matrix  elements   corresponding   to  dipole  (/=  1)  and  quadrupole  (j  =  2) 
radiation  should  be  calculated  from  the  equation 


4  See  W    Hcitler,   The  Quantum  Theory  of  Radiation,   3rd  Ed.,  Now  York:  Oxford  Univer- 
sity Press,  1954 


Chapter  10 

The  Linear  Harmonic  Oscillatoi 


The  problem  of  the  linear  harmonic  oscillator  is  one  of  the  most 
important  problems  of  theoretical  physics.  It  arises  in  connection 
with  the  construction  of  a  simple  thoery  of  vibrations  in  many  dif- 
ferent branches  of  physics  (mechanics,  classical  electrodynamics, 
radio  physics,  optics,  atomic  physics,  and  so  on).  Most  of  the  new 
theories  developed  in  atomic  physics  have  been  "tested"  in  a 
number  of  simple  problems,  including,  in  particular,  the  problem 
of  the  harmonic  oscillator. 

The  motion  of  a  complex  system  can  often  be  analyzed  in  terms 
of  the  normal  modes  of  vibrations  that  are  equivalent  to  the  vibra- 
tions of  harmonic  oscillators.  For  us  the  harmonic  oscillator 
presents  additional  interest  for  methodological  reasons  because  this 
problem  has  an  exact  solution  and  thus  provides  a  simple  illustra- 
tion of  an  application  of  the  Schrodinger  equation.  The  harmonic 
oscillator  problem  was  also  very  useful  in  the  development  of 
quantum  field  theory  (second  quantization)  and  in  the  analysis  of 
the  zero-point  energy  of  the  electromagnetic  field  (vacuum  fluctua- 
tions). In  quantum  mechanics,  the  harmonic  oscillator  problem  has 
direct  applications  in  regard  to  the  black-body  radiation,  molecular 
spectra,  and  the  specific  heats  of  diatomic  molecules. 

A.  THE  OSCILLATOR  IN  THE  BOHR  THEORY 

We  shall  first  consider  the  classical  theory  of  the  harmonic 
oscillator.1  Let  us  assume  that  a  certain  particle  of  mass  rnQ  is 
acted  upon  by  an  elastic  force 

F=-kx,  (10.1) 

where  k  is  the  elastic  constant.  Then  the  equation  of  motion  for  the 
harmonic  oscillator  can  be  written  in  the  form 

m,x=—kxt  (10.2) 

which  describes  an  ordinary  oscillatory  process. 


*In  this  section  we  shall  be  concerned  with  the  case  of  one-dimensional  motion  and,  for 
simplicity,  we  shall  use  the  term  "harmonic  oscillator*  rather  than  "linear  harmonic  oscil- 
lator." 
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The  solution  of  this  equation  has  the  form 

(10.3) 


where  (o  =  -*-  =  l/  —    is  the  angular  frequency  and  a  is  the  ampli- 

I  f  ttlQ 

tude  of  oscillation.  From  Eq.  (10.3),  it  follows,  in  particular,  that 
the  acceleration 

w  =  x=  —  ao/2  cos  co/  (10.4) 

differs  from  zero  and,  consequently,  that  the  oscillation  of  a  charged 
particle  will  be  accompanied  by  radiation,  the  intensity  of  which 
(that  is,  the  radiant  energy)  is  given  by  the  following  equation  in 
accordance  with  Eqs.  (10.4)  and  (2.2): 

w/  1e-  ,  ...  2          £2flso>4  /-i  n  [-  v 

Wcl  =  -3-3  Way  =  -3-T-  •  (l  °-5  > 

In  deriving  (10.5),  we  calculated  the  average  value  of  cosV  from 
the  equation 

T 

Y.  (10.6) 

We  shall  now  express  the  intensity  of  radiation  Wc\  in  terms  of  the 
total  energy  E  =  T-{-V  of  the  harmonic  oscillator.  From  the  well- 
known  equations  for  the  potential  energy 


(10.7) 

0 

and  kinetic  energy 

T  =  -?2—  =  '"" "*a*-  sin  V  (10.8) 

of  a  harmonic  oscillator,  we  find 

(10.9) 


With   this   equation,  we  can  eliminate  the  quantity  a1  from  (10.5), 
obtaining 


Thus,  on  the  basis  of  classical  theory  one  can  determine  both 
the  intensity  and  frequency  of  the  radiation;  it  is  also  found  that  this 


THE     LINEAR    HARMONIC    OSCILLATOR  151 

frequency  is  the  same  as  the  frequency  of  mechanical  vibrations  of 
the  harmonic  oscillator.  The  energy  of  the  harmonic  oscillator, 
according  to  the  classical  theory,  can  have  any  value  in  a  continuous 
range  from  zero  to  infinity. 

Several  new  features  were  introduced  in  the  problem  of  the 
harmonic  oscillator  by  the  Bohr  quantum  theory.  For  example, 
according  to  Bohr's  theory,  the  energy  levels  had  to.  be  discrete 
and  could  be  found  from  the  quantization  rule 

jx  =  2xfin,  (10.11) 

where 

pA  =  g=Wue.  (10.12) 

Let  us  substitute  p^dx  =  m^r-~  dt  =  m()coV  siii*«>M/  into  Eq.   (10.11). 

Then,  taking  into  account  Eq.  (10.9)  and  integrating  over  a  complete 
period,  we  find 

En  =  nhv>,  (10.13) 

where  the  quantum  number  «  =  0,  1,  2,  3, ...  . 

We  showed  above  that,  according  to  Bohr's  theory,  the  energy 
of  a  harmonic  oscillator  can  take  only  discrete  values,  and  radiation 
will  be  emitted  only  when  the  oscillator  makes  a  transition  from 
one  energy  level  to  another. 

The  discovery  of  a  discrete  spectrum  of  energy  levels  of  a 
harmonic  oscillator  played  an  important  part  in  the  theory  of 
black-body  radiation.  Planck's  law  was  first  obtained  under  the 
assumption  that  the  harmonic  oscillator  could  radiate  and  absorb 
light  only  in  the  form  of  discrete  quanta  of  energy  /h». 


B.    EIGEN  FUNCTIONS  AND  EIGENVALUES 
OF  THE   ENERGY 

In  order  to  determine  the  behavior  of  the  wave  function  in  the 
harmonic  oscillator  problem,  let  us  first  give  a  graphical  repre- 
sentation (Fig.  10.1)  of  the  dependence  of  the  potential  energy  V 
on  x 


i/  --. 


From  the  graph,  it  is  seen  that  inside  the  potential  well,  where  the 
total  energy  E  of  the  harmonic  oscillator  is  greater  than  V  (E  >  V), 
the  solutions  for  <f  will  take  the  form  of  harmonic  functions.  Inside 
the  potential  barrier  (£<V) ,  the  solutions  will  contain  two  parts, 
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one  exponentially  decreasing  and  the  other  exponentially  increasing 
(see  Fig.  10.1).  It  is  clear  that  the  solution  of  the  problem  reduces 
to  finding  the  conditions  under  which  there  is  no  exponentially 
increasing  solution.  Just  as  in  the  case  of  a  rectangular  potential 
well  with  infinitely  high  walls  (Chapter  4),  such  levels  exist  only 
at  certain  discrete  values  of  the  energy,  which  we  must  determine. 


Fig.    10.1.     Wave    function    of    the    harmonic    oscillator    for    an 
arbitrary  value  of  the  energy. 

Since  the  potential  energy  V   of  a  harmonic  oscillator  depends 
only  on  the  x  coordinate,  the  Schrbdinger  equation  can  be  written  as 


(10.14) 


Setting 

and  introducing  a  new  variable 

we  obtain 
where 


B  =  -1-  =  — °- 
,\'n          H 


(10.15) 
(10.16) 

(10.17) 


First,  let  us  find  the  asymptotic  behavior  of  the  wave  function 
at  ;—  »  co,  that  is,  when  the  constant  X  is  negligible  in  comparison 
with£2.  Then 

*«  —  £k%  =  0.  (10.18) 
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We  shall  seek  a  solution  of  this  equation  in  the  form 

*w  =  ^'.  (10-19) 

Since 

^=  (4e*P  +  2e)  e**  ^  4e2EV'9, 
we  find 


(10.20) 
and,  consequently, 


Since  the  wave  function  must  remain  finite  at  £—  vr*:oo,  coefficient 
Ci  must  be  set  equal  to  zero.  Coefficient  C4  can  be  taken  to  be  equal 
to  unity,  since  the  wave  function  has  not  yet  been  normalized.  Thus, 
the  asymptotic  behavior  of  the  wave  function  <|i  is  described  by  the 
function 

y00  =  c-i'&.  (10.21a) 

We  shall  seek  a  solution  of  the  wave  function  in  the  general  form 

ty  =  ty00u  =  e-l'*'u,  (10.22) 

which  already  takes  in  account  the  behavior  at  infinity.  Substituting 
(10.22)  into  (10.16)  and  considering  that 

(e-li&  it)"  =  [//'  —  2;w'  +  (P  —  1  )  u]  e~1'**, 
we  obtain  the  following  equations  for  u  : 

K"-2te'  +  (X_l)tt  =  0.  (10.23) 

Let  us  look  for  a  solution  of  this  equation  in  the  form  of  a  series 


Substituting   this  expression  for  u  into  Eq.   (10.23)  and  collecting 
terms  with  the  same  power  of  6  ,  we  find 


Equating    the   coefficients   of    5*    to    zero,   we   obtain  a  recursion 
formula  for  the  coefficients  bk 


This  formula  relates  the  coefficients  bk  to  6A+9,  and,  therefore,  the 
series  (10.24)  will  consist  of  even  powers  (if  the  minimum  subscript 
k  is  even)  or  odd  powers  (if  the  minimum  subscript  k  is  odd). 
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If  the  series  (10,24)  does  not  terminate  at  a  certain  maximum 
power,  then  beginning  with  /z>-^-,  every  term  is  positive  and, 

consequently,  the  series  diverges  for  large  values  of ;  .  This  leads 
to  the  second  asymptotic  solution  <tasym  ^el/i^  at  ;  —  co  ,  which  we 
disregarded  earlier  because  it  diverges.2  Therefore,  in  order  for 
the  boundary  conditions  to  be  satisfied  (<);  — Oat  £  — <.o),  we  must 
terminate  the  series  (10.24)  at  a  certain  km&K==n.  We  thus  require 


*W  =  0.  (10.26) 

From  (10.26)  and  (10.25),  we  find 

X  =  2n+l,  (10-27> 

and,  consequently, 

£n==/ho(/z  +  -J  (10.28) 

where  n  can  assume  any  positive  integral  value,  including  zero. 
These  are  the  only  energy  values  for  which  the  wave  function 
vanishes  at  infinity. 

Comparing  this  expression  with  the  one  obtained  from  the  Bohr 
theory  [see  (10.13)],  we  note  the  appearance  of  a  term  called  the 
zero -point  energy 

E,  =  ~  A<D.  (10.29) 

Later,  we  shall  show  that  the  existence  of  the  zero- point  energy 
is  related  to  the  uncertainty  principle  and  thus  to  the  wave  properties 
of  particles.  The  zero-point  energy  does  not  affect  the  frequency 
of  the  radiation,  however,  since  it  cancels  out  in  the  expression 

for  the  frequency  o>nn»  =  — /l--^— n-  . 

Let  us  now  find  the  wave  function  of  the  harmonic  oscillator. 
The  recursion  formula  (10.25)  for  the  coefficients  bk- 
takes  the  form 


2This  follows  from  the  fact  that  at  large  k  the  ratio  of  the  coefficients 
fe/2  is  the  same  as  for  the  series  expansion  of  the  function  e 


Therefore, 
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where  /e</i  .   Setting  the  coefficient  of  the  highest  power  Armax  =--•=/« 
equal  to3 

bn  =  2*9  (10.30) 

we  obtain 


The  power  series  with  a  finite  number  of  terms  obtained  for  the 
function  u  is  called  the  Hermite  polynomial 

u  -  Hn  (5)  -  (2  Q«  -  ^fl>  (25)-*  + 


i  .."(^~1.Hyl~2)(/l~3)  /2£V»-4  -  1-  .  .  .  -f  I  b£  for  odd    n         (10.32) 

1    &0for  even  n 

In  particular, 

ffo  (*)=!,     Wi(5)  =  25,     W4  (?)  =  «*  -2, 

//3(c)  =  8E3—  125  (10.33) 

The  Hermite  polynomials  Hn(l)  can  be  written  in  closed  form4 

//,)=-i^.  do.34) 


This   coefficient  can  always  be  chosen  arbitrarily,    since  the  normalization  factor  of 
the  wave  function  i//     is  still  undetermined 

To    show    this,    we    introduce    the    function    v  =  c     *   ,  which    satisfies    the    equation 

v    \-  2fv    --    0 
Differentiating  this  equation    n  +  1    times,  and  using  the  Leibnitz  formula 


we  obtain 


Making  the  substitution 


we  find  that  the  function  w  satisfies  Eq.  (10  35),  and  thus  it  is  proportional  to  the  Hermite 
polynomial 


The  proportionality  factor  An  can  be  found  by  equating  the  coefficients  of  £2n    As  a 
result,   it  is  found  that  An  -  (-l)n,  from  which  we  obtain  Eq    (10  34). 


156  NONRELATIVISTIC    QUANTUM    MECHANICS 

From    (10.32)   it  is  clear  that  //„(;)  satisfies  Eq.  (10.23)  pro- 
vided A  =  2/i-f-l 

Q  (10.35) 


According  to  (10.22)  and  (10.32),  the  solution  of  the  Schrbdinger 
equation  for  a  harmonic  oscillator  is 

tyn  =  Cne  l/**Hn(l)>  (10.36) 

where  5  is  related  to  the  coordinate  x  by  Eq.  (10.15).  The  coef- 
ficient Cn  can  be  determined  from  the  normalization  condition 

-Ho  -|^cn 

^  ^;i(/v  =  .v0Cl  ij  e-^Htt  (;)  //„  (?)  </;  =  1.  (10.37) 

—00  -03 

Substituting  the  closed  form  (10.34)  for  one  of  the  polynomials 
//„(;),  we  obtain 

(-lyjCoCS  f  7/n  (5)  -rf^ a  dE  —  1.  (10.38) 

— 00 

Using  the  rule  for  transferring  the  derivative  of  one  function  to 
another  [see  (7.17)]  (that  is,  we  integrate  by  parts  n  times),  we 
obtain 

"f"° 

r~4*dn%n®<k=  1-  (10.39) 


Noting  that  from  (10.32) 

~//n(6)  =  2"/z!  (10.40) 

i*i 

and 

H^co  ^ 

^e-^=  \\,  (10.41) 

— 00 

we  find 


V  2n/l!    |/'W    A' 

that  is, 


ao.42) 


THE     LINEAR    HARMONIC    OSCILLATOR 


157 


In  a  similar  manner,  we  can  easily  prove  the  orthogonality  condition 
for  the  wave  functions.  To  do  this,  it  is  enough  to  represent  one  of 
the  Hermite  polynomials,  specifically,  the  one  with  larger  n,  in 
closed  form  (10.34).  The  orthogonality  condition  also  follows  from 
the  general  investigation  of  the  Schrbdinger  equation;  it  can  be 
proved  that  the  eigenfunctions  corresponding  to  the  different  eigen- 
values are  orthogonal. 


Fig.  10.2.  The  energy  eigenvalues  and  the 
behavior  of  the  corresponding  eigenfunctions 
of  the  harmonic  oscillator  for  small  quantum 
numbers  (n  —  0,  1,  2)  For  comparison,  the 
classical  probability  distribution  functions 
pn  are  indicated  by  the  dotted  lines 


In  the  case  of  a  harmonic  oscillator,  the  orthonormality  condi- 
tion is 

+00 

\     tyn'tyndx  =  &/m'.  (10.43) 

—oo 

For  small  quantum  numbers /i  =  0,  1,  2,  .. .  ,  when 


-, 


(10.44) 
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the  probability  distribution  functions  \tyn\*  (see  Fig.  10.2)  differ 
considerably  from  the  corresponding  classical  probability  functions. 
In  the  classical  case  the  probability  of  a  particle  being  at  a  certain 
point  is  proportional  to  the  amount  of  time  the  particle  spends  there, 

and,  consequently,  is  inversely  pro- 
portional to  the  particle  velocity; 
therefore,  the  classical  probability 
is  proportional  to5  (a2— r2)-1^,  where 
a  is  the  maximum  displacement  of 
a  classical  oscillator  from  the  equi- 
librium position.  As  we  would  ex- 
pect, it  is  only  for  large  quantum 
"  numbers  that  there  is  a  relatively 
close  agreement,  on  the  average, 
between  the  quantum  and  classical 
probabilities  (see  Fig.  10.3). 


Fig.   10.3.    Comparison    of   quantum 
and  classical  results  for  the  oscil- 
lator in  the  region  of  large  quantum 
numbers  (here  n  -  10). 


C.    ZERO- POINT  ENERGY  OF  THE 

HARMONIC  OSCILLATOR  AND 
THE  UNCERTAINTY  PRINCIPLE 


We  have    seen   that  in  quantum 
mechanics  the  minimum  energy  of 

the  harmonic  oscillator  is  given  by  Eq.  (10.29)  and  cannot  go  to 
zero,  whereas  in  the  classical  theory  or  the  Bohr  theory,  the 
minimum  energy  is  equal  to  zero. 

We  shall  now  show  that  the  existence  of  the  zero- point  energy 

(10.29)    in   the    Schrjjdin^er  Jtheory j[g^as  mentioned  above,  very 

to  thq  unQprtflfnty  principle  (8.11).    For  the  "case 


(10.45) 


of  a  harmonic  oscillator  the  uncertainty  principle  becomes 


*« 


'  This  can  be  easily  shown  on  the  basis  of  the  following  simple  qualitative  consider- 
ations. The  probability  of  finding  a  particle  at  a  particular  point  can  be  roughly  charac- 
terized by  the  absolute  value  of  the  reciprocal  of  its  velocity,  since  the  time  a  particle 
spends  in  a  region  will  be  greater  in  regions  where  the  velocity  is  smaller  than  in  regions 
where  it  is  greater.  Consequently,  the  probability  of  finding  a  particle  in  a  region  with 
larger  velocities  will  be  smaller  than  that  for  a  region  with  smaller  velocities.  In  the 
case  of  a  harmonic  oscillator,  we  have  from  Eq.  (10.3)  x/a  -  cos  cot  and  t/eaa  -  —  sin  a>t. 
Taking  the  square  of  both  equations  and  adding  them,  we  get 

1 


and,  therefore, 
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Here  we  have  replaced  (AT)2  by  'x*  and  (A/7y2  by  ~p\  This  is  justified 
by  the  fact  that  the  wave  functions  are  real  and  are  either  even  or 
odd.  Indeed,  since  the  expression  fy*xty  =  xfy*  is  odd,  we  have 

x==  f  <|> **<!*/*  =  0. 
Hence 


Similarly,  using  the  boundary  conditions  at  infinity,  we  find 


n 


*"=<>, 

—00 


that  is, 


Substituting  the  value  of  /?  from  (10.45)  into  the  equation  for 
the  total  energy 

£  =  tf=Jl  +  ^(  (10.46) 

we  obtain 

^T^  +  -^-  <10'47> 

From  this  it  is  seen  that  the  energy  E  cannot  vanish  at_any  value 
of  (*4).  Indeed,  although  the  second  term  vanishes  for  (xr)  =  Q,  the 
first  term  becomes  infinite.  Conversely,  when  (**)==  oo,  the  first 
term  vanishes  and  the  second  becomes  infinite. 

Thus,  the  fact  that  £mln  differs  from  zero  is  directly  connected 
with  the  uncertainty  relation  (10.45)  or,  in  other  words,  with  the 
fact  that  it  is  impossible  to  calculate  exactly  the  position  and 
momentum  simultaneously. 

Let  us  find  that  value  of  (x2)  at  which  Eq.  (10.47)  has  a  mini- 
mum. Setting  the  derivative  of  this  function  with  respect  to  (P) 
equal  to  zero,  we  obtain 


or 
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Substituting  this  into  (10.47),  we  have 

E^~-\-~  =  —  .  (10.48) 

Hence,  £min=Y»  which  is  exactly  the  same  as  the  value  for  £"0  found 

from  the  wave  theory  [see  (10.29)]. 

The  existence  of  a  finite  zero-point  energy  of  the  harmonic 
oscillator  is  one  of  the  most  characteristic  manifestations  of  the 
wave  properties  of  particles.  Thus,  the  experimental  verification 
3f  the  zero-point  vibrations  was  of  great  significance  for  quantum 
mechanics.  The  zero-point  energy  £0  was  first  observed  experi- 
mentally in  the  scattering  of  x-rays  by  crystals  at  low  temperatures. 
If  there  were  no  lattice  vibrations  at  low  temperatures  (£0  =  0),  as 
predicted,  for  example,  by  the  Bohr  theory,  there  would  be  no  inter- 
action between  the  x-rays  and  the  crystal  lattice,  and  consequently 
no  scattering  would  occur.  If,  on  the  other  hand,  the  minimum 
energy  were  different  from  zero  (£0  ~f-  0)  for  T-^0,  the  scattering 
cross  section  at  low  temperatures  should  approach  a  finite  limit. 
Experiments  have  confirmed  that  the  second  situation  corresponds 
to  the  true  state  of  affairs  and,  therefore,  the  conclusions  of  the 
Schrbdinger  wave  theory  are  justified. 


D.    SELECTION  RULES.    INTENSITY  OF  RADIATION 

Let  us  consider  the  problem  of  radiation  from  the  harmonic 
oscillator  on  the  basis  of  wave  mechanics.  For  this  purpose,  as 
was  indicated  in  Chapter  9  [see  (9.22)],  we  must  calculate  the 
matrix  elements 

Xn,n=  J  ^JC-MJC,  (10.49) 

where  <^is  given  by  Eq.  (10.36). 

As  a  preliminary  step,  we  shall  derive  a  recurrence  relation 
for  the  Hermite  polynomials,  which  will  be  necessary  in  our  further 
discussion.  From  the  definition  (10.32)  of  the  Hermite  polynomials 
Hn  (£),  we  find 

Hn  (S)  =2n 


from  which  it  follows  that 

,(£).  (10.50) 
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Next,  substituting  these  equations  for  the  derivatives  into  Eq. 
(10.35)  and  replacing  n-+nr  -\-  1,  we  obtain  a  recurrence  relation  for 
the  Hermite  polynomials 

Wn,(\)=n'Hn,-^)  +  ±Hn>+\  (£).  (10.51) 

By  means  of  this  formula,  the  matrix  element  (10.49)  can  be  reduced 
to  the  form 

4  <x>  -foo 


=  xlCnCn>{±    J  ^// 


In  terms  of  the  wave  functions  <|>,  we  obtain 


V+l*ndX+nrg  J    ^l 


Since  the  functions  <l>nare  orthonormal,  we  have 

(10.53) 


From  this  expression  it  follows  that  the  only  nonvanishing  matrix 
elements  are  those  for  which  n'  =  n—\  or  n'  =  n-}-\\  therefore, 
the  selection  rules  for  the  quantum  number  n  are  expressed  by  the 
equation 

A/i  =  /i  — /if  =  -±l,  (10.54) 

which  indicates  that  only  transitions  between  neighboring  levels  are 
possible. 

For  the  wave  function  as  given  by  Eq.  (10.42),  we  obtain  from 
Eq.  (10.53) 


xn-\,n~xb  Y     V» 

.  (10.55) 


where  x(}  is  determined  from  Eq.  (10.15).  For  the  frequency  of  the 
radiation,  we  obtain  exactly  the  same  expression  as  in  the  classical 
theory: 


n.n-l — ' 


(10.56) 


162  NONRELAT1V1STIC    QUANTUM    MECHANICS 

The   energy   levels  and  allowed  transitions  are  shown  in  Fig. 
10.4. 

Since  spontaneous  emission  is  possible  only  when  transitions 
occur  from  higher  to  lower  energy  lev  els  (En^>En.i)tit  follows  from 

(9.22)  that  the  intensity  of  radiation 
of  the  harmonic  oscillator  W  =  Wn  n  l 
is 


2 


co  '  ""•  *       3  w0c3 

c    ,t  o   -,..,      E^_£^  (1(h5?) 


E0/h  Comparing  this  equation  with  Eq. 

(10.10),  which  was  obtained  from  the 

l''ig.  10  4.  Allowed  transitions  of  the  classical     theory,     W6     S66     that    for 

harmonic  oscillator  large  quantum  numbers  (n f*>\)9  when 

En  I  »  £<„  both  equations  yield  practic- 
ally the  same  result.  Transitions  to  higher  energy  levels  n-+n+\ 
are  possible  in  the  case  of  induced  transitions.  The  occurrence  of 
spontaneous  upward  transitions  is  also  possible  under  the  condition 
that  the  energy  loss  in  the  harmonic  oscillator  is  compensated  by 
the  simultaneous  liberation  of  a  large  amount  of  energy,  as  for 
example,  in  transitions  of  atomic  electrons  (see  Chapter  12, 
spectra  of  diatomic  molecules). 

KLjf  Find  the  eigenvalues  of  the  harmonic  oscillator  using  the  WKB  method. 

lution.  According  to  (10.14),  the  wave  function  for  the  harmonic  oscillator  is 


where 


According  to  Eq.  (5.75),  the  eigenvalues  are  determined  from  the  equation 


Evaluaung  this  integral,  we  find  that  the  energy  eigenvalues  are  the  same  as  in  (10.28); 
that  is,  the  zero-point  energy  is  also  present. 


^jr oblem  10. 2,1  Construct  the  theory  of  the  harmonic  oscillator  in  the  p  representation 
(forlne  one-dimensional  case).  Find  the  equation  of  motion,  the  eigenvalues  and  the 
eigenf  unctions. 

(]2 

Solution.    Since  x2  =  —  n2  ---  in  the  p  representation,  we  can  write  the  Schrodlnger 
equation  as 
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that  is,  transforming  the  wave  equation  for  the  harmonic  oscillator  from  the  A"  representa- 
tion to  the  p  representation  and  introducing  the  new  parameters 


where 

Po  =  Ymolitot 

we  find  that  the  wave  equation  changes  identically  into  itself 

*"  +  (*-  19)  ?  =  0 

(the   prime  indicates  the  derivative  with  respect  to  >)).  Using  the  solutions  (10.28)  and 
(10.42),  we  can  write  in  the  p  representation 


and 

--(P-V 

0     \  n     ] 

""\Po 
It  is  easily  verified  that  this  wave  function  satisfies  the  normalization  condition 

\fn(p)\idp^l. 

\Problem  10.3jFind  the  eigenfunctions  and  energy  spectrum  of  an  electron  (e  =  —  e<>  <  0) 
LOvtTl^  irweOflsiant,  uniform  magnetic  field.  Show  that,  according  to  the  quantum  theory, 
the  "electron  gas"  must  be  diamagnetic. 

Solution.  Let  the  magnetic  field  be  directed  along  the  z  axis  (A/ v  =  ffy=  $Jl£=z&W). 
We  can  then  write  for  the  components  of  the  vector  potential  Av~X(£f,Av  =  A2  =  0. 
The  motion  of  an  electron  is  described  by  the  Schrodinger  equation  (see  also  Problem  8.4) 

2ifTQ 

Since  the  coordinates  y  and  z  do  not  appear  explicitly  in  this  equation,  we  shall  look 
for  a  solution  in  the  form 

L 
For  the  function/^),  we  obtain  the  equation 


where 

Wl  cfik, 


It  is   easily   seen  that  this  equation  has  the  same  form  as  Eq.  (10.14)  for  the  harmonic 
oscillator. 

Consequently,  we  can  use  solutions  (10.27)  and  (10.36)  to  determine  the  eigenfunctions 
and  eigenvalues.  We  thus  find 
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where  Hn  (;)  is  the  Hermite  polynomial,  Cn  =  I  —  >an/y!°Q.va        *  is  the  normalization  < 

I  nnt    (n\)    j 

efficient,  o  =   °      -is  the  frequency  of  Larmor  precession,  and 


For  the  eigenvalues,  we  have 


The  last  term  in  this  equation  is  simply  the  kinetic  energy  of  a  free  electron  moving 
along  the  z  axis,  and  is  of  no  special  interest. 
The  first  term 

£n  =  fV^T  (2/1+1),  (10.59) 

where  fi0  is  the  Bohr  magneton,  corresponds  to  the  additional  energy  acquired  by  an 
electron  in  the  magnetic  field.  This  additional  term  represents  the  energy  of  electron 
motion  in  the  xy  plane,  which  is  perpendicular  to  the  magnetic  field. 

This  conclusion  is  in  agreement  with  classical  theory,  according  to  which  an  electron 
placed  in  a  magnetic  field  precesses  with  the  Larmor  frequency  o  in  a  plane  perpen- 
dicular to  the  magnetic  field. 

In  the  classical  theory,  however,  the  energy  of  an  electron  in  a  magnetic  field  is 
determined  entirely  by  its  unquantized  kinetic  energy.  Therefore,  according  to  the  clas- 
sical theory,  an  electron  gas  generally  exhibits  no  diamagnetic  properties. 

In  the  quantum  theory  the  energy  (10.59)  can  be  interpreted  in  terms  of  the  appearance 
of  an  additional  magnetic  moment  ji  of  an  electron,  which  makes  the  following  contribu- 
tion to  the  energy: 


(  10.60) 
Comparing  (10.60)  with  (10.59),  we  find 

f^  =  —  M2/I+1). 

Since  the  number  In  +  I  assumes  only  positive  values  (n  =0,  1,  2,  3,  .  .  .),  the  addi- 
tional moment  of  an  electron  will  be  directed  along  the  negative  z  axis.  This  naturally 
leads  to  the  diamagnetism  of  free  electrons  in  a  metal. 

It  should  be  noted  that  in  quantum  mechanics,  solution  (10.58)  corresponds  to  harmonic 
vibrations  along  the  z  axis  along,  whereas  in  classical  theory  the  circular  trajectory 
means  that  there  are  harmonic  vibrations  along  both  x  andy  axes,  with  a  phase  difference 

of-^r-.   The  reason  for  this  is  that  the  energy  is  independent  of  the  momentum  fik2.  Con- 
sequently, degeneracy  occurs  and  the  solution  for  a  given  energy  has  the  form 

(10.61) 

where  the  coefficients  Ck  are  arbitrary  amplitudes  satisfying  the  normalization  condition 


In  classical  theory  there  is  also  an  indeterminacy,  since  the  center  of  the  circular  trajec- 
tory is  not  uniquely  specified. 

The  general   solution  (10.61)  corresponds   to  a  set  of  circular  trajectories  having 
different  centers  located  along  the  y  axis. 
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Concluding  the  above  discussion,  let  us  note  that  the  solution  (10.61;  includes  the 
harmonic  oscillations  along  both  x  and  y  axes.  This  can  be  seen  by  examining  the  expres- 
sion for  the  energy  [Eq.  (10.59)]  which  "represents  a  sum  of  the  energies  of  two  harmonic 
oscillators  (note  that 


ilem  1QA  Show  that  the  matrix  element  of  the  product  of  two  operators  M  (A)  and 
N  (A),  which  fife"'  independent  of  quantum  numbers,  is  equal  to  the  sum  of  the  products  of 
the  matrix  elements  of  these  operators,  that  is, 


(10.62) 

k 

Solution.  Writing 

)^  =         <[£  (.v)  M  (x)  I  (x  -  A')  N  (*')  tyn  (A-')  dx  dx' 


and  using  the  relation  (see  Chapter  4) 


together  with  the  fact  that  the  operators  M  (,v)  and  N  (A-')  are  independent  of  quantum  num- 
bers, we  readily  prove  Eq.  (10.62). 


feroblem^  10.5J  Find  the  selection  rules  for  quadrupole  radiation  emitted  by  the 
harmOnlc"  oscillator.  Find  the  intensity  of  spontaneous  quadrupole  radiation  and  compare 
it  with  the  intensity  of  dipole  radiation.  Obtain  Eq.  (9.24),  which  relates  the  intensity 
of  quadrupole  radiation  to  that  of  dipole  radiation, 

Solution.  In  Problem  9.1  we  found  that  the  quadrupole  radiation  is  proportional 
to  the  matrix  element,  which  according  the  the  preceding  problem  can  be  written  as 


Substituting  the  values  of  xn,k  from  Eq.  (10.55),  we  find  the  following  nonvanishing  matrix 
elements  of  the  quadrupole  radiation: 


(10.63) 

That  is,  the  selection  rules  for  the  quadrupole  radiation  of  the  oscillator  are 

An  =  0,  it  2. 

The  probability  of  spontaneous  emission  (n'  =  n  —  2,   An  =  2)  is  calculated  from  Eq. 
(9.26).  For  large  quantum  numbers  when  E  ==£  nh<*t  we  have 

16  L2^2 
15    mfc6"' 

According  to  (10.57),  the  intensity  of  dipole  radiation  is 
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Using  the  last  two  equations,  we  obtain 


2E 

where  a9  = 3  is  the  square  of  the  classical  amplitude  of  oscillations.  This  relation  is 

//I0<oa 

in  agreement  with  Eq.  (9.24). 


^Problem  10.6J  Show  that  the  center  of  a  wave  packet  composed  of  the  solutions  for 
the  nUllllUUft!  UUcTllator  moves  according  to  the  laws  of  classical  mechanics. 

Show  that  this  wave  packet  does  not  spread  with  time.  Obtain  the  transition  to  the 
quasi-classical  case  (n  >•  v  ;>  lf  where  2v  +  Us  the  number  of  waves  in  the  wave  packet). 

Solution.  Let  us  assume,  for  simplicity,  that  the  wave  packet  is  composed  of  2v  -f- 1 
eigenf unctions  of  equal  amplitudes 


where    tyn+j  are  the  eigenfunctions  of  the  harmonic  oscillator,  and  w  is  the  mechanical 
frequency  of  vibration. 

The  coordinate  x  of  the  center  of  the  wave  packet  is  given  by 


with 


2v 


where  a  =  T/   ^(2/l+  O.is  the  classical  amplitude  of  oscillations. 

f  Wo4* 

It  follows  from  the  last  two  equations  that  A:  obeys  the  classical  equation  of  motion 
for  the  harmonic  oscillator 


Evaluating  x*  in  a  similar  manner,  we  obtain  (A*)2  =  xa  —  Jt8  for  the  mean-square 
deviation 

(Kx)*  =  f  C2  — -~W(  B*—  —  }cos2c< 
where 


2  '  2"  27+~ 


Consequently,  (A A:)*  oscillates  about  a  certain  mean  value  and,  therefore,  does  not  spread 
with  time. 
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In  the  quasi-classical  case,  we  have 

1 


2v+i;> 
and  thus 


x  c«  a  cos  «£,    (Ax)8  c^s =s.  const. 

2v-j-  I 


From  this  it  follows  that  the  larger  the  number  of  waves^the  smaller  will  be  the  width 
of  the  packet,  and  finally,  for  (2v  -f  1)  >  1  f  the  width  tends  to  zero. 


Chapter  1  1 

General  Theory  of  Motion  of  a   Particle   in 
a   Centrally  Symmetric  Field 


The  problem  of  the  motion  of  a  particle  in  a  central  field  of 
force  (a  field  in  which  the  potential  depends  on  the  distance  alone, 
and  not  on  the  angles)  is  one  of  the  standard  problems  of  quantum 
mechanics.  This  problem  provides  a  basis  for  the  quantum  theory 
of  the  rotator,  which  is  of  considerable  importance  in  connection 
with  the  spectra  of  diatomic  molecules,  the  theory  of  the  hydrogen 
atom,  the  nonrelativistic  theory  of  the  deuteron,  and  so  on.  It  is 
worth  noting  that  in  a  central  field  of  force  the  dependence  of  the 
wave  function  on  the  angles  U  and  <p  is  completely  unrelated  to  the 
specific  form  of  the  potential  energy.  Accordingly,  the  spherical 
harmonics  are  of  general  validity;  they  are  applicable  to  any 
centrally  symmetric  field.  The  classical  analog  of  the  quantum- 
mechanical  investigation  of  the  angular  parts  of  the  wave  function 
is  the  derivation  of  the  law  of  conservation  of  angular  momentum 
in  a  central  field  of  force.  This  law  is  also  independent  of  the 
specific  form  of  the  potential  energy. 


A.     SCHRODINGER'S  EQUATION  IN  SPHERICAL 
COORDINATES 

The  problem   of   the  motion  of  a  particle  in  a  central  field  of 
force 

F=F(r)$,  (H.l) 

is  usually  solved  in  the  spherical  coordinates  r,  §  and  cp,  which  are 
related  to  the  Cartesian  coordinates  (see  Fig.  ll.l)by  the  equations 


Jt  =  pcos<p,  r/=psincp,  z  =  rcosfr,  p=rsinO.  (H.2) 

We  shall  now  write  the  Schrodinger  equation  in  spherical  co- 
ordinates. 

First,  using  the  general  definition  of  the  potential  energy  V  as 
a  quantity  whose  negative  gradient  is  equal  to  the  force  /%  we  have 

dV  =  —(F-dr).  (11.3) 
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For  the  case  of  central  forces  (11,1),  we  obtain 
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and  hence 


dV  =  —  -~(xdx  +  ydy  +  zdz)  =  —  Fdr, 

r 

V(r)  =  -  J  F(r)dr, 


(ll.Sa) 


(11.4) 


where  the  lower  limit  of  integration  is  chosen  in  accordance  with 
the  convention  that  V  (r)  vanishes  at  infinity. 

In  particular,  if  the  central  forces  are  due  to  Coulomb  inter- 
action 

r=-"-r, 

where  Z<?0  is  the  nuclear  charge  (e  =  —  ?0is  the  charge  of  an  electron 
moving  around  the  nucleus),  we  obtain  for  the  potential  energy 


r 

J7/»s  7/>a 

-£?4dr=-^-«. 


(ll.Sa) 


Now  let  us  find  the  expression 
for  the  Laplacian  y2  in  spherical 
coordinates.  Using  the  identity 


=  v  V 


(11.6) 


we  shall  first  find  the  components 
of  a  vector 


=  V 


(11-7) 


in  spherical  coordinates. 

Bearing  in  mind  that  a  gradient 
expresses  the  spatial  rate  of  a 
change  of  scalar  field  in  a  certain 

direction     (£/=  Vf  fy*=  '^r)t     we 
obtain,  in  accordance  with  Fig.  11.1, 

(11.8) 


r  sin 


^.   11.1.   Spherical   coordinates.    The  volume 
element  in  spherical  coordinates. 


Let  us  use  the  definition  of  divergence 

(BdS) 


V  •  B  =  lim 
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where  d*x  is  the  volume  element  in  spherical  coordinates 

d*x  =  r8  sin  ft  dr  d*  dy  (11.  10) 

(XL  stands  for  the  coordinates  r,  ft  and  <p),  and  dSt  stands  for  elementary  areas  perpendic- 
ular to  the  directions  dr,  rd$t  and  pflfy  respectively: 


(11.11) 
dSy  =  rdr  db. 

With  the  help  of  Eq.  (11.8),  we  obtain 

(g  ' 


<1U2) 

from   which  we  readily  find  the  expression  for  the  Laplace  operator  in  spherical  co- 
ordinates 


Setting  in  Eq.  (11.13) 

1 

r8 
and 


we  have 


so  that  the  Schrodinger  equation  (4.8)  takes  the  form 


where 

*tW  =  -^[fi-Vf(r)]  (11-18) 

is,  according  to  Eq.  (11.4),  a  function  of  the  radius  r  only. 

B.    SEPARATION  OF  VARIABLES.    EIGENFUNCTIONS 

We  shall  solve  Eq.  (11.17)  by  the  method  of  separation  of  vari- 
ables. Let  us  represent  the  desired  function  as  a  product  of  the 
radial  and  angular  parts 

cp).  (11.19) 
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/  r2  \ 
Multiplying  the  original  equation  by  l^pU  we  obtain 

_:*£.  (n.20) 


Since  the  left-hand  side  depends  only  on  r  and  the  right-hand  side 
only  on  the  angles  0  and  <p,  this  equation  can  be  satisfied  only  if 
both  the  left-  and  right-hand  sides  are  separately  equal  to  a  con- 
stant X,  called  the  separation  constant. 

We,  therefore,  obtain  the  following  equations  for  the  radial  and 
angular  parts,  respectively: 


0.  (11.22) 

The  important  point  to  note  is  that  the  angular  part  of  the  wave 
function  does  not  contain  the  variable  r  and  is  independent  of  the 
specific  form  of  the  potential  energy  V  .  Consequently  f  as  we  men- 
tioned at  the  beginning  of  this  chapter,  the  angular  solution  will  be 
valid  for  any  central  force. 

Using   the   method  of   separation  of  variables  for  the  angular 
part  alone,  we  set 


Y  =  &(*)$(<?),  (11.23) 

and  thus  obtain  the  following  equations  for  the  functions  &  and  CD: 


.  (11.25) 

Here  m2  is  the  separation  constant  and  we  have  used  the  following 
notation: 

<n-26> 

*}  =      .  (11-27) 

where  partial  derivatives  are  replaced  by  total  derivatives,  since 
each  of  the  functions  9  and  O  depends  only  on  a  single  variable. 

Thus,  we  have  obtained  three  equations—  (11.21),  (11.24)  and 
(11.25)—  for  the  energy  eigenvalues  £/  and  the  corresponding  eigen- 
f  unctions  <f,  .  The  last  equation  contains  only  a  single  parameter 
ni\  whereas  the  first  and  second  contain  two  parameters  each. 

Since  the  solution  of  one  equation  yields  the  eigenvalues  for  only 
one  parameter,  we  must  begin  the  solution  of  the  entire  problem  by 
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solving   (11.25);  then,  knowing  m2,  w    proceed  to  solve  (11.24)  and 
finally  (11.21). 

To  find  the  normalization  constant,  we  use  the  relation 

00  K  2lt 

f  ty*ycPx=  ^  R+Rr*dr  f  9*0  sin  ta»  f  $*<M<p, 
b  b  b 

which  shows  that  each  of  the  functions  can  be  normalized  separately: 

00 

J/?*/?r»dr=l,  (11.28) 

f)  =  l,  (11.29) 


l.  (11.30) 

The  particular   solution  for   the   azimuthal   function  [see  Eq. 
(11.25)]  can  be  written  in  two  ways: 

Q  =  Ceim*  (11.31) 

or 

<i>  =  A  cos  (mcp  +  <p0).  (11.32) 

Each  of  these  solutions  has  a  different  physical  interpretation. 
The  solution  (11.31)  represents  a  wave  traveling  around  the  cir- 
cumference of  a  circle  and  corresponds,  for  example,  to  uniform 
circular  motion  of  an  electron.  On  the  other  hand,  the  solution 
(11.32)  is  associated  with  standing  waves  and  corresponds,  for 
example,  to  oscillations  of  an  electron  along  an  arc.  In  order  for 
the  function  <I>  to  describe  the  motion  of  an  electron  around  the 
nucleus,  it  must  have  the  form  of  traveling  waves  (11.31).  More- 
over, since  a  solution  proportional  to  e  im*  can  be  obtained  from 
the  first  solution  by  replacing  (m)  by  (-  m),  we  can  take,  without 
any  loss  of  generality, 

([>z=Cetmf,  (11.33) 

where  the  quantity  m  assumes  both  positive  and  negative  values. 

Since  the  wave  function  must  be  unique  (see  Chapter  4,  Section 
B),  the  function  $(?)  must  be  periodic  with  a  period 


.  (11.34) 

It  follows  that 
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and  therefore  the  quantity  m,  which  is  called  the  magnetic  quantum 
number,  assumes  only  integral  values 

m  =  0,  drl,  =t2,  dt3 (11.35) 

From  the  normalization  condition  (11.30),  we  find  C=--=.    It 
is  readily  shown  by  direct  calculation  that  the  functions 

1     -*"f  (11.36) 

satisfy  the  condition  of  orthonormality 

2ie 


Since  we  now  know  the  eigenvalues  m  and  the  wave  function 
associated  with  the  azimuthal  angle  <p  ,  we  can  proceed  to  solve 
Eq.  (11.24).  Introducing  the  new  variable 

x  =  cosO  (11.37) 

and  denoting  derivatives  with  respect  to  x  by  primes,  Eq.  (11.24) 
becomes 

=0.  (11.38) 


It  can  be  seen  that  (11.38)  has  singular  points  at  *  =  -!:  1,  that  is, 
points  at  which  one  of  the  coefficients  of  B  becomes  infinite.  To 
eliminate  this  divergence,  we  shall  look  for  a  solution  9  in  the 
form 

«  =  (!  --*«)*/2M.  (11.39) 

Substituting    (11.39)    into    (11.38)   and  dividing   all   the   terms  by 
(1—  r>)'/2,  we  obtain 


u  =  Q.     (11.40) 


We  eliminate  the  singularity  in  the  last  term  by  setting 


Since  the  fundamental  equation  for    H  depends  only  on  w2,  the 
solutions  corresponding  to  these  two  values  of  s  both  satisfy  the 
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same  equation,  and,  consequently,  there  must  be  a  simple  linear 
relationship  between  them: 

9(m)  =  49(— m).  (11.41) 

It  is,  therefore,  sufficient  to  solve  Eq.  (11.40)  for 

s  =  m^0.  (11.42) 

With  the  help  of  Eq.   (11.41),  the  solution  can  be  automatically 
extended  to  the  negative  values  of  in. 

Under  the  condition  (11.42),  Eq.  (11.40)  becomes 

(1—  *«)w"  —  2x(m+l)u'  +  (\  —  m(m+l))u  =  0.          (11.43) 

Since   this    equation  has  no  singularities,  its  solution  may  be 
represented  as  a  polynomial 


Substitution  of  this  polynomial  into  Eq.  (11.43)  gives 


Collecting  the  terms  with  the  same  powers  of  x,  we  obtain 


This  yields  a  recurrence  relation 

(11.45) 


which  gives  the  relationship  between  the  coefficients  of  the  series 
(11.44).  Since  the  coefficients  ak^  are  expressed  in  terms  of  ak  and 
thus  only  alternate  terms  are  related,  the  function  u  will  be  either 
even  or  odd  depending  on  whether  the  main  term  is  even  or  odd. 

We  require  that  the  series  (11.44)  terminate  at  some  maximum 
power  k  =  q,  so  that 


Then  from  (11.45)  we  obtain 

+l),  (11.46) 


where 

(7  =  0,  1,  2,  3,...  (11.47) 
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(that  is,  q  is  equal  to  the  power  at  which  the  series  is  terminated). 
Introducing    the    orbital    angular  momentum  quantum  number   I 

I  =  q  +  m9  (11.48) 

we    find    that,  just  like  the  numbers    q  and  m,  this  number  can 
assume   only   positive   integral   values   (including   zero),   that  is, 

J  =  0,  1,  2,  3,  ...  ,  (11.49) 

and  from  (11.48)  it  follows  that 

l^m.  (11.50) 

According  to  (11.48)  and  (11.46),  we  have 

X  =  /(/  +  l),  (11.51) 

and,  therefore,  Eq.  (11.40)  can  be  reduced  to  the  form 

(1—  ^)a"  —  2jt(m+l)af  +  [/(/+!)  —  m(m+  I)]  £i  =  0,     (11.52) 
where 

.  (11.53) 


Instead  of  determining  the  relationship  between  the  coefficients  aq 
and  aqYl  by  means  of  the  recurrence  relation  (11.45),  let  us  represent 
the  solution  (11.53)  in  a  closed  form.  For  this,  we  introduce  the 
function 

i;  =  (**-!)',  (11.54) 

satisfying  the  equation 

(1—  *V  +  2*fo  =  0,  (11.55) 


which  is  easily  obtained  by  taking  the  first  derivative  of  v  with 
respect  to  x.  Differentiating  Eq.  (11.55)  with  the  help  of  Leibnitz's 
rule  [see  (10.34a)]  and  setting 


we  obtain  the  following  equation  for  the  function  u^ 

-m)  ^=0.      (11.57) 
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We  note  that  this  equation  is  exactly  the  same  as  the  differential 
equation  (11.52)  for  the  function  u.  Consequently,  functions  u  and 
i/i  must  be  proportional  to  each  other 

w=const  •  MI.  (11.58) 

Since  the  normalization  coefficient  of  the  function  6  has  not  yet 
been  determined,  we  can  set  this  proportionality  constant  equal 

to  2//I  in  order  to  make  the  solution  (11.58)  f or  m  =  0  identical  with 
the  Legendre  polynomial 


We  thus  obtain 


from   which,   with   the   help   of   Eq.    (11.39),  we  find  the  following 
expression  for  the  function  ft: 


Here  P?    is   an  associated   Legendre  polynomial   defined  by  the 
equation 


andC/m  is  the  normalization  coefficient. 

Although  (11.61)  was  obtained  for  positive  values  of  m,  it 
can  also  be  extended  to  include  the  negative  values  of  m  by  using 
the  well-known  relation1 

-  W.  <n-62> 


*To  prove  Eq.  (11  62),  we  put  it  in  the  following  form  with  the  help  of  (11.61): 
2  '  d     '"'       21 


dx 


+]ml  dx'~{ml 


(1163) 


Since  P/m  and  P/"1"  must  be  linearly  related  to  each  other  [see  (11  41)1,  it  is  sufficient  for 
us  to  show  that  the  coefficients  of  the  leading  power  of  x  on  both  sides  of  Eq.  (11.63)  are 
equal  to  each  other,  that  is, 


This  is  easily  shown  shown  since 


fof 


__ 

"TiT  -    }  ("-*)'  (11  64) 

dX  0  for         k  >  n  . 
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From  (11.61)  and  (11.62),  it  follows  that  the  range  of  variation  of 
the  quantum  number  m  is 


m  =  0,  ±1,  dr2,...,±/; 

since  for  |m|>  /  the  solution  Pzm  vanishes.2 

The  coefficient  C/m  in  (11.60)canbefoundfrom  the  normalization 
condition 

•K  \ 

J  er  «,m  sin  »  d»  =  J  er  (*)  e,m  (*)  d*  =  !  • 

Substituting  the  solution  (11.60)  and  using  (11.62),  we  obtain 


Transferring  the  derivative  from  the  second  factor  in  the  integrand 
to  the  first  factor  /-j-m  times  (that  is,  expanding  the  integral  by 
parts  /-{-m  times),  we  obtain 


Using  the  equation  [see  also  (11.64)] 


and 


f  \-*'dx= 

J        x 


we  find 


2Because  of  the  linear  relationship  between  P/m   and  P/  m,   many  authors  present  the 
solution  for  the  function  6  in  the  form 

e  -  cpjm'<*). 

We  shall  not  use  this  form,  since  in  this  case  the  recurrence  relation  between  the  asso- 
ciated Legendre  polynomials  is  more  complicated  than  for  the  solution  (11  60)  (the  recur- 
rence relation  is  important  in  connection  with  the  selection  rules  and  the  solution  of  the 
Dirac  equation). 


176  NONRELATIVISTIC    QUANTUM    MECHANICS 

Then 

ni  66) 
{ii.oo  ; 


For  the  spherical  harmonic  KF(d,  9),  which  satisfies  Eq.  (1.22), 
the  relations  (11.23),  (11.36)  and  (11.66)  yield 


The  orthonormality  condition  for  the  spherical  harmonics  takes 
the  form3 

?'Y  YTdQ  =  ««,  8mm,  (11.68) 

After  having  obtained  the  eigenvalues  of  the  parameters  m  and 
X,  we  may  proceed  to  the  solution  of  Eq.  (11.21)  for  the  radial  part 
in  which  there  remains  only  one  unknown  parameter.  The  radial 
solution,  however,  can  be  obtained  only  if  the  form  of  the  potential 
energy  V  (r)is  specified,  and,  therefore,  we  shall  leave  this  question 
aside  until  we  come  to  consider  various  specific  forms  of  V  (r)  in 
the  following  chapters. 


C.    PHYSICAL  MEANING  OF  THE  QUANTUM  NUMBERS 
/  AND  m.  ANGULAR  MOMENTUM 

We  have  found  that  the  quantum  number  /  characterizes  the 
eigenvalue  X  =  /(/-f-  l)of  the  operator  -yi,?  [see  (11.  22)  and  (11.51)], 
which  is  a  part  of  the  Hamiltonian 


Comparing  this  Hamiltonian  with  the  classical  Hamiltonian  function 

V  (r),  (11.70) 


To  prove  the  orthonormality  condition  (11  68),  we  substitute  the  expression  (11.67)  for 
the  spherical  harmonics  into  (11  68).  Integrating  over  the  angle  <p,  we  can  readily  show  that 


2TT 


Integrating  the  Legendre  polynomials  over  the  angled,  we  can  set  m'—  m.  Then  without 
loss  of  generality,  we  can  take  /' ^ /.  The  case  I'-  I  was  considered  above  in  connec- 
tion with  the  determination  of  the  normalization  coefficient.  In  similar  fashion,  it  can  be 
readily  shown  that  for  l'<l  the  integral  (11  68)  vanishes  as  a  result  of  transferring  the 
derivative  from  the  function  with  subscript  /  to  the  function  with  subscript  /  . 
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where  pr  =  mj,  and  L  =  mQr^  we  see  that  the  operator  (— 
corresponds  to  the  square  of  the  angular  momentum  ta  in  the 
classical  case,  and  the  operator  (—  AV)  to  the  square  of  the  radial 
momentum  p?. 

Let   us  investigate  these  analogs  in  more  detail.   As  we  know 
from  classical  mechanics,  the  angular  momentum  L  is  defined  as 

L=  rxp.  (11.71) 


If  external  forces    F  exert  a  torque  M  =  r  x  F,  the  time  rate  of 
change  of  L  is  given  by 


In  the  case  of  central  forces  (F\\r)9  no  torque  M  is  exerted  and, 
consequently, 

L  =  const. 

In  classical  mechanics  this  result  is  known  as  the  law  of  conserva- 
tion of  angular  momentum;  it  appears  in  Kepler's  theory  of  plane- 
tary motion  as  the  law  of  conservation  of  areal  velocities  (that  is, 
the  law  of  areas). 

To  generalize  the  classical  expression  for  the  angular  momen- 
tum to  the  quantum  case,  we  replace  the  classical  momentum  p  in 

(11.71)  by  the  momentum  operator  p  =  y  y-  We  then  obtain 

L  =  rxp=yrxv  (11.73) 

or 

Lx=ypz  —  zpy, 

L^zpt  —  xp,,      &  (11.74) 


We  note  that  the  angular  momentum  operators  Lx,  Ly  and  Lz  do  not 
commute  with  each  other.  For  example,  by  direct  calculation  of 
the  commutation  relation  between  L^andLy,  we  find 

LxLy  —  LyLx  =  (yp,  —  zpy)  (zpx  —  xpz)  —  (zpx  —  xpg)  (ypz  —  zpy). 

Using  the  commutation  relation  between  the  momenta  and  the 
corresponding  coordinates  [see  (7.7)  and  (7.8)],  we  find 

LxLy  -  L,L,  =  —  M  (ypx  -  xpy)  =  ML,.  (11.75) 

Similarly,  it  can  be  shown  that 
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To  express  the  square  of  the  angular  momentum  operator 

(11.77) 


in  spherical  coordinates,  we  must  first  determine  the  components 
Lx,  Ly  and  Lz  in  these  coordinates.  Using  the  relations  (11.2)  between 
Cartesian  and  spherical  coordinates,  we  have 

dty  _  <ty  dr    ,    dty  dy    ,    dfy  dz  _ 
"W       dx  "dS  ~i~  ~dy  W  *   ~dz  W~~ 

=  r  cos  0  cos  cp  0|  -f-  /-  cos  d  sin  cp  ^  —  r  sin  0  ^-  = 
_  xz  d^  j^  .yz  (?|»  dfy 

—  ~r~dx~r~Y~ty  ~~*~fc  '  (11.78) 

d^  _  d<\>  dx_\<fy_  dy    ,    dty_  dz  _ 
<         ~x  ~d<     '    d  •"          < 


^g-.         (11.79) 

Multiplying  Eq.   (11.78)  by  -  and  Eq.  (11.79)  by(—^),  adding  these 
products,  and  remembering  that  p*  =  x*-}-y*t  we  obtain  the  relation 

(11.80) 


Now  let  us  multiply  Eq.   (11.78)  by   —£  and  Eq.  (11.79)  by  —   r. 
Then,  proceeding  in  the  same  way  as  before,  we  obtain 


Using  Eqs.  (11.79)  and  Eqs.  (11.74),  we  find 


(11.82) 
(11.83) 


Introducing  the  variable  jc  =  cos  d  (which  should  not  be  confused  with 
the  Cartesian  coordinate  *),  Eqs.  (11.82)  and  (11.83)  can  be  written 
in  the  form 


dl.85) 
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To  determine  the  effect  of  these  operators  on  the  spherical 
harmonics,  let  us  take  advantage  of  the  fact  that  a  particular 
spherical  harmonic  can  be  represented  either  as  (11.67)  or  as 


Operating  directly  with  L2  on  the  spherical  harmonic,  we  find 

(11.87) 


From  this  it  follows  that  the  quantum  number  m  characterizes  the 
z  component  of  the  angular  momentum. 

,  To  determine  the  effect  of  the  operator  Lx-\-iLy  on  the  spherical 
harmonic  \f  ,  we  use  the  expression  (11.67),  and  for  the  effect  of 
the  operator  Lx  —  iLv  we  use  the  equivalent  expression  (11.86).  Then 
from  the  equation 


it  follows  that 

(L,  it  iLy)  YT  =  —  H  V(l  +  1  ±  m)  (/  ip  m)  YT±l-  (11.88) 

Equations  (11.87)  and  (11.88)  yield 


X  (L,  +  /  Ly)  +  Lj]  YT  =  -  ft'yl.  f  *T  = 

--^/(/.j-  i)  yj*.  (11.88a) 

The  last  equation  shows  that  YT  is  an  eigenfunction  of  the  operators 
L2  and  LA  This  follows  from  the  fact  that  the  operators  L,  and  La 
commute  not  only  with  each  other,  but  also  with  the  Hamiltonian  H. 
Since  the  operators  Lx  and  Lv  do  not  commute  with  L,f  it  is  impos- 
sible to  find  a  wave  function  that  would  be  a  simultaneous  eigen- 
function of  the  operator  Lz  and  the  operators  Lx  or  Ly.  This  does 
not  mean,  however,  that  the  direction  of  the  z  axis  is  a  preferred 
direction,  since  it  can  be  chosen  arbitrarily. 

The  spherical  harmonic  can  be  written  in  such  a  manner  that  it 
will  be  an  eigenfunction  of  the  operators  L^.  and  L4  In  this  case, 
it  will  no  longer  be  an  eigenfunction  of  the  operator  Lz  (see  Prob- 
lem 12.2). 
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D.    ANALYSIS  OF  THE  RESULTS 

Quantum-mechanical  results  are  generally  analyzed  either  by 
finding  their  classical  analogs  or  by  comparison  with  the  results 
of  Bohr's  semi-  classical  theory,  which  has  a  simple  physical 
interpretation.  To  apply  Bohr's  theory  to  the  motion  of  a  central 
field,  we  start  from  the  classical  law  of  conservation  of  angular 
momentum;  we  then  conclude  that  the  motion  takes  place  in  a  single 
plane  and  that  the  angular  momentum  vector  (which  is  perpendicular 
to  this  plane)  has  the  magnitude 

L  =  p9=g  =  m0/^  =  const.  (11.89) 

Applying  the  quantization  rules,  we  find  the  discrete  values  that  the 
angular  momentum  can  assume 


Hence 

L^=n*H\  (11.90) 

where 

/if=l,  2,  3  .....  (11.91) 

If  the  z  axis  is  not  perpendicular  to  the  plane  of  the  orbit,  then 
the  Bohr  theory  allows  us  to  quantize  the  projection  of  the  angular 
momentum  vector  on  the  z  axis.  Tnis  is  known  as  space  quantiza- 
tion. Then 


where 

n^  =  —  nvt   —/if+l,  ....  0  .....  /if  —  1,  nr  (11.93) 

It  follows  from  this  that  the  angle  a  between  the  direction  of  the 
angular  momentum  L  and  the  z  axis  is  given  by  the  equation 

cosa  =  ^;  (11.94) 

nv 

that  is,  it  can  assume  only  certain  discrete  values. 

Space  quantization  is  illustrated  in  Fig.  11.2,  which  shows  that 
^  =  /l9  corresponds  to  the  case  when  L  is  parallel  to  the  z  axis 
(Fig.  11.2a),  where  \=  —  n^  corresponds  to  the  case  of  anti- 
parallel  L  (Fig.  11.2b).  Finally,  for  /i^  =  0fthe  vectors  are  mutually 
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perpendicular  (Fig.  11. 2c).  It  is  obvious  that  space  quantization  has 
a  meaning  only  when  there  is  some  preferred  direction  in  space,  for 
example,  the  direction  of  the  magnetic  field  intensity  vector.4  If 
there  is  no  preferred  direction,  the  orientation  of  the  z  axis  may 
be  taken  as  perpendicular  to  the  plane  of  the  orbit. 


Fig.   11.2.     Space    quantization    according    to    the    Bohr 
theory  (for  L  -  1;  in  units  of  h) 

Now  let  us  compare  the  quantum-mechanical  results  with  those 
of  the  Bohr  theory  for  the  square  and  the  -z  components  of  the 
angular  momenta: 


/i=  l,  2,  3,  4,  ... 


It  is  seen  that  L'qm  is  zero  when  /  =  0,  whereas  Lfa  can  never  be 
zero.  This  means  t.h^  thp.  gfato  ^yjth  /  =  Q  has  no  classical  analog. 
It  follows*  that  the  angular  'momentum  ot'*ftn  atom  in  tke  lowest 
state  is  "equal,  to  zero,  contrary  to  me  results  ol:  me  soiir  theory. 
T'Ee  fiSfpBtimentai  aata  from  atomic  spectroscopy  fully  confirm 
this  quantum- mechanical  result. 


It   is,    of  course,   understood   that   in   the   presence   of  a   magnetic   field  the   central 
symmetry  is  disturbed. 
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In  the  Bohr  theory  the  direction  of  the  z  axis  can  be  taken  to 
coincide  with  that  of  the  orbital  angular  momentum.  In  this  case 
n  ==/zv  and,  therefore, 


In  the  wave  theory  this  case  corresponds  to  m  =  /,  when 

^max=fr2/2'  <n-96> 

whereas 


The  appearance  of  the  additional  orbital  angular  momentum  tiH  is 
related  to  the  noncommutativity  of  the  angular  momentum  opera- 
tors  Lxt  Ly  ,  and  Lz  ,  as  a  result  of  which  the  angular  momentum 
components  cannot  have  simultaneous  definite  values.  Therefore, 
when  £*==&zmax=^»  the  components  Lx  and  Ly  do  not  vanish  but 
have  certain  minimum  values  satisfying  the  relation 

^m=  Vmax+  (^X)min+  (^)min.  (11.98) 

The  minimum  value  of  (AZ^)*  and  (ALV)'2  may  be  obtained  with  the 
help  of  the  uncertainty  principle  [see  (8.13)]: 

(ALXi^)Ur=  1  1  LxLy  -  LyLx\*  =  1  tfl*  max  =  }  *V.    (11.99) 

Since  the  problem  is  symmetric  with  respect  to  the  x  and  y  axes, 
we  may  set  (ALx)2min=  (A^)3min.  Hence,  we  obtain 

(^)Jnin=(A^fein=*iT'  (11.100) 

and  the  sum  of  (AL^)^in  and  (ALv)imln  is  exactly  equal  to  the  addi- 
tional angular  momentum  H*l.  As  a  result  we  arrive  at  Eq.  (11.97). 
Thus  the  nature  of  this  additional  term  is  the  same  as  that  of  the 
zero-point  energy  of  the  harmonic  oscillator.  Both  are  related 
to  the  uncertainty  principle.  For  large  values  of  the  orbital  angular 
momentum  quantum  number/,  we  can  neglect  the  term  H*l  in  (11.96) 
in  comparison  with  /W%  so  that  in  fact  we  have  the  Bohr  semi- 
classical  solution. 


Chapter  12 

The  Rotator 


Spherical  harmonics,  which  are  the  eigenfunctions  of  the  square 
of  the  angular  momentum,  have  their  main  application  in  the 
quantum  theory  of  the  rotator,  that  is,  in  the  quantum-mechanical 
description  of  the  free  motion  of  a  point  over  a  sphere.  The 
results  of  the  theory  of  the  rotator  can  be  used  directly  in  connec- 
tion  with  the  spectra  of  diatomic  molecules.  Since,  however,  the 
angular  part  of  the  wave  function  in  a  central  field  is  also  described 
by  spherical  harmonics,  many  predictions  from  the  theory  of  the 
rotator  (for  instance,  the  angular  dependence  of  the  wave  function 
<1>  and  the  selection  rules  for  the  quantum  numbers  /  and  m)  remain 
unchanged  in  the  theory  of  a  particle  in  a  central  field  (for  example, 
a  particle  in  a  Coulomb  field  in  the  problem  of  the  hydrogen  atom). 


A.    EIGENFUNCTIONS  OF  THE  ROTATOR 

We  shall  first  write  the  basic  results  of  the  quantization  of  the 
rotator  according  to  Bohr's  theory.  These  will  be  used  as  a 
starting  point  in  our  further  discussion. 

Suppose  a  point  is  moving  over  a  sphere  of  radius  r  =  a  =  const. 
Let  the  origin  of  the  coordinate  system  be  at  the  center  of  the 
sphere.  The  potential  energy  V  (r)  is  then 


Since  the  reference  level  of  the  potential  energy  can  be  specified 
in  any  desired  manner  by  defining  its  value  at  some  point  as  zero, 
we  set 


The  total  energy  of  the  rotator  is  then  equal  to  its  kinetic  energy 

E=T  =  ^f-.  (12.2) 

The  generalized  momentum  p^  which  here  has  the  meaning  of  the 
angular  momentum,  is  found  to  be 
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Using  Bohr's  quantization  rule,  we  obtain 

Pf  =  /!,*,  (12.4) 

and,  consequently, 


where  J  =  m()a*  is  the  moment  of  inertia. 

The  quantum-  mechanical  theory  of  the  rotator  is  a  special 
case  of  the  problem  of  motion  of  a  point  in  a  central  field  of 
force.  Consequently,  we  shall  use  Eq.  (1.21)  to  determine  the 
radial  function  R(r): 

0.  (12.6) 


We  have  here  set  the  potential  energy  equal  to  zero  and  substituted 
^  =  /(/-f-l)  in  accordance  with  Eq.  (11.51).  Since  r  =  a  =  const  for 
the  rotator,  function  /?(r)  =  /?(a)  =  const,  that  is,  v?#(a)  =  °- 
energy  Et  is  now  easily  found  from  Eq.  (11.51) 


*'*(/+!) 
27 


7) 

*    ' 


Comparing  this  equation  with  Eq.  (12.5),  we  see  that  in  the  Bohr 
theory  E'^n*,  whereas  in  quantum  mechanics  £~  /(/+!)•  As  has 
already  been  mentioned  in  Chapter  11  (Section  D),  this  difference  is 
due  to  the  noncommutativity  of  the  components  of  Lxt  Ly  and  Lz  of 
the  angular  momentum  operator;  it  is  one  of  the  characteristic 
features  of  quantum  mechanics.  Both  equations  become  identical 
only  for  large  quantum  numbers,  that  is,  when  /*>/. 

According  to  Eq.  (12.7),  the  energy  of  the  rotator  depends 
only  on  the  orbital  angular  momentum  quantum  number  /.  The 
magnetic  quantum  number  m9  which  characterizes  the  projection 
of  the  angular  momentum  L  on  the  z  axis  (and,  consequently,  the 
orientation  of  the  angular  momentum  in  space),  does  not  appear  in 
the  expression  for  £,.  The  eigenfunctions  Yf  corresponding  to  the 
eigenvalue  Et  [see  (11.67)]  do,  however,  depend  on  m.  Since  m  can 
vary  from  —  /  to  +/  [see  (11.50)],  each  energy  eigenvalue  Et  will 
have  2/+  1  corresponding  mutually  orthogonal  eigenfunctions;  they 
describe  the  state  of  the  rotator  and  differ  only  in  the  orientation 
of  their  angular  momentum  L  relative  to  the  z  axis.  In  this  case 
the  energy  level  E7is  said  to  be  (2/+  l)-fold  degenerate. 

In  general,  a  state  of  a  system  (or  a  given  level)  is  said  to  be 
N-  fold  degenerate  if  Af  linearly  independent  eigenfunctions  corre- 
spond to  the  given  energy  eigenvalue, 
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Phv*l6ftl  emanation  for  thfi  faSW*™^  ^.tlp  merfg  levels 
of  the  rotator  is  tKat  ine  rotator  forms  a  centrally  symmetric 
system,  and  consequently  all  directions  passing  theough  the  origin 
of  coordinates  are  equivalent.  From  these  considerations  it  fol- 
lows that  degeneracy  will  occur  in  any  centrally  symmetric 
system. 

If  there  exists  a  preferred  direction,  for  example,  one  deter- 
mined by  the  direction  of  a  magnetic  field,  the  central  symmetry  is 
disturbed  and  the  possible  directions  of  the  angular  momentum  L 
are  no  longer  equivalent.  As  a  result,  the  degree  of  degeneracy  is 
either  reduced,  or  the  degeneracy  can  be  completely  removed. 

In  spectroscopic  notation  the  energy  levels  are  called  terms; 
for  example,  the  level  corresponding  to  /  =  0  is  called  the  s  term 
and  the  level  corresponding  to  1=  1  ,  the  p  term.  For  the  d  term, 
/  =2;  for  the  /  term,  1  =  3;  for  the  g  term,  1  =  4;  and  so  on. 
Correspondingly,  the  rotator  is  said  to  be  in  the  s  state  when  /  =  0» 
in  the  p  state  when  /  =  1  ,  and  so  on. 

Let  us  consider  in  more  detail  the  5  and  p  states  of  the  rotator. 
Since  l  =  m  =  Q,  in  the  5  state,  it  follows  from  Eq.  (11.67)  that  the 
eigenfunction  7J  corresponding  to  the  energy  eigenvalue  £0  =  0  is 


and  the  probability  density  |  KJ|*  is  given  by 


In  the  p  state  /  =  1,  and  the  quantum  number  m  can  have  any  of 
the  three  values   —1,  0  and  -pi.    Consequently,  the  energy  eigen- 

value £1  =  -j-  is  associated  with  the  three  eigenfunctions: 


?  =  -  1"  shift,  (12-10> 

f       on 

Y\  =  y~±  cos*.  (12.11) 

(12.12) 


The  corresponding  probability  densities  are 


(12.13) 

~cos'&.  (12.14) 
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The  probability  distribution  functions  (12.9),  (12.13)  and  (12.14) 
are  plotted  in  Fig.  12.1;  they  are  shown  only  in  the  zy  plane  because 
|  V™  |  does  not  depend  on  the  angle  ft.  To  obtain  a  complete  picture, 
it  is  necessary  to  rotate  the  graph  about  the  z  axis. 


Fig.   12.1.    Probability   density    distribution   functions    for   the 
rotator. 

It  can  be  seen  from  Eq.  (12.9)  and  Fig.  12.1a  that,  for  a  rotator 
in  the  s  state,  the  angle  <p ,  which  gives  the  direction  of  the  angular 
momentum  L  relative  to  the  z  axis,  is  arbitrary.  This  was  to  be 
expected  since  the  angular  momentum  L*  =  #*/(/-{-  1)  is  equal  to  zero 
in  this  case.  A  material  particle  at  rest  has  an  equal  probability 
of  being  at  any  point  on  the  spherical  surface  of  radius  a.  In  other 
words,  all  positions  of  the  rotator  are  possible.  There  is  no  clas- 
sical analog  of  this  state. 

From  Eq.  (12.13)  and  Fig.  12. Ib  it  follows  that  in  the  p  state, 
when  /=  1  and  m  =  dL  1  ,  the  most  probable  of  all  the  possible  orbits 
of  the  rotator  is  the  one  located  in  the  xy  plane.  The  states  with 
m=l  and  m=  —  1  will  have  different  directions  of  rotation:  for 
m=\  the  rotator  will  rotate  clockwise  (the  angular  momentum  L  is 
parallel  to  the  z  axis);  for  /n  =  —  1  it  will  rotate  counterclockwise 
(the  angular  momentum  L  is  antiparallel  to  the  z  axis).  When 
1  =  1  and  m  =  0,  the  most  probable  orbit  of  the  rotator  lies  in  a 
plane  passing  through  the  z  axis  [see  Eq.  (12.14)  and  Fig.  12.1c]. 
In  this  case,  the  orientation  of  the  angular  momentum  is  perpen- 
dicular to  the  z  axis. 

It  is  worth  mentioning  that  a  similar  analysis  of  the  angular 
part  of  the  wave  function  applies  to  all  systems  characterized  by 
central  symmetry. 
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B.    SELECTION  RULES 

As  has  already  been  shown,  the  selection  rules  indicating  the 
changes  of  the  quantum  numbers  that  correspond  to  allowed  transi- 
tions can  be  expressed  in  terms  of  the  matrix  elements 

(r$£'=  j',(V7')*rV7^.  (12.15) 

If  the  matrix  element  vanishes  for  some  particular  chapgff  oj:  foe 
quantum  .numbers,  the  corresponding  transitioiTis  f^rb^d^en  (there 
wiin>e  no  ~  radiation).  6nce~  we  know  the  selection  rules,  we  can 
immediately  find  both  the  frequency  and  the  intensity  of  the  radia- 
tion [see  Eq.  (9.22)], 

Let  us  replace  the  coordinates  x,  y  and  z  (that  is,  replace  r)  in 
Eq.  (12.15)  by  the  following  new  variables: 

(12.16) 
(12.17) 
(12.18) 

From  the  physical  point  of  view,  this  is  equivalent  to  resolving 
the  motion  of  the  rotator  into  three  parts—  an  oscillation  along  the 
z  axis  described  by  the  z  component;  a  clockwise  rotation  in  the 
xy  plane  (the  £  component)  ;  and  a  counterclockwise  rotation  in  the 
xy  plane  (the  TI  component).  In  combination,  these  three  components 
completely  describe  the  motion  of  a  point  over  the  surface  of  a 
sphere. 

In  terms  of  the  new  variables  the  determination  of  the  selection 
rules  reduces  to  a  calculation  of  the  matrix  elements: 


;*'  =     (Yf)*  sin  M*  Y?  dQ,  (12.20) 


($£'  =  (   (Y*1)*  sin  Oe  "  Yf  dQ9  (12.21) 

where,  for  the  sake  of  simplicity,  we  have  set  a=  1. 
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Using  the  recurrence  relations  for  spherical  harmonics1 

(12.22) 
(12.23) 
together  with  the  orthonormality  condition  (11.68),  we  find 

(z)/m '  =  const  lm,.  m»r.  /± , ,  (12.24) 

(tfim  =  const  8m',  m+i*rt  /±1 ,  (12.25) 

(*i)/m '  =  const  &m.(  m«  i&/i,  i±i.  (12.26) 

Therefore,  we  obtain  the  following  selection  rules: 

(a)  for  vibration  along  the  z  axis 

Am  =  m  —  m'  =  0,     A/  =  /~~/'  =  zt:  1;  (12.27) 

(b)  for  clockwise  rotation 

Am  =  -l,     A/  =  ±l;  (12.28) 

(c)  for  counterclockwise  rotation 

Am  =  +  1 ,     A/  =  it  1 .  (12.29) 

We  have  just  shown  that  the  only  allowed  transitions  are  those 
for  which  the  changes  of  the  magnetic  quantum  number  tn  and  the 
orbital  quantum  number  /  are 

(12.30) 
(12.31) 


The   coefficients   A   and  B   can  be   found   in    a   fairly    simple  way     We   substitute   the 
expansion  (11  67)  into  Eq.  (12  22),  setting 


Then,    dividing  all  the  terms  by  etm<f  (I-*2)™72  and  equating  the  coefficients  of  xl~ 
and  the  coefficients  of  X  ~m~     on  the  left-  and  right-hand  sides  (nothing  further  is  obtai 
by  equating  the  coefficients  of  the  remaining  powers  of  x),  we  find 


Similarly,  we  find 

-    ± 

?       (2/+l)(2/  +  3) 

(12  23a) 

B±(/,m)   =  '"'  " 
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We  note  that  these  selection  rules  for  the  quantum  numbers  m  and 
/  will  hold  for  any  centrally  symmetric  system  including,  in 
particular,  the  hydrogen  atom. 

From   the   selection  rules,  we  can  find  the  possible  emission 
(or  absorption)  frequencies  of  the  rotator: 


Substituting  the  expression  for  the  energy  £,  [see  Eq.  (12.7)] 
and  considering  that  the  moment  of  inertia  of  the  rotator  does  not 
change  in  this  case,  we  can  reduce  Eq.  (12.32)  to  the  form 

<%<  =  -A  [/  (/  +  l  )  _  /'  (''  +  1  )]  •  (12.33) 


- 
From  Eqs.  (12.31)  and  (12.33),  we  obtain 


<12«34> 


tt|i ,+  ,  =_.*(/ -I- l)f  (12.35) 

where  the  frequency  co,  ,_j  corresponds  to  a  transition  from  a  higher 
energy  level  to  a  lower  one  (a  downward  transition)  and  w/,/+i  to  an 
upward  transition. 

C.    SPECTRA   OF  DIATOMIC   MOLECULES 

There  are  three  main  types  of  spectra— the  continuous  spectrum 
of  radiation  emitted  by  a  heated  body  (for  example,  black-body 
radiation,  with  a  spectral  distribution  described  by  Planck's 
formula);  line  spectra  (or  atomic 
spectra),  caused  by  transitions  of 
atomic  electrons  between  energy 
levels  (for  example,  the  Balmer 
series  for  the  hydrogen  atom);  and 
finally,  band  spectra  character- 
izing molecular  radiation.  A  band 
spectrum  consists  of  bright  bands 
with  a  sharp  edge  on  the  low-fre- 
quency side  and  a  diffuse  boundary  Fig.  12.2.  Diagram  of  a  diatomic 
on  the  high-frequency  side.  Only  molecule, 

a  high- re  solution  spectrograph  can 

show  that  the  band  actually  consists  of  a  series  of  individual 
lines. 

As  we   shall   see  below,  band  spectra  are  directly  related  to 
the  rotational  motion  of  molecules. 

Let  us  consider  a  molecule  consisting  of  two  atoms  with  masses 
mi  and  m2  separated  by  a  constant  distance  r  (see  Fig.  12.2).  An 


•O 
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example  of  such  a  molecule  is  provided,  in  a  first  approximation, 
by  the  diatomic  HC1  molecule.  It  is  well  known  that  in  the  case  of 
two  or  more  particles,  the  center  of  mass  moves  as  a  single  particle 
whose  mass  is  equal  to  the  sum  of  the  masses  of  all  the  particles: 

™Sum=I>«.  (12.36) 

The  relative  motion  of  the  particles  is  characterized  by  the  reduced 
mass  ///red*  the  reciprocal  of  which  is  equal  to  the  sum  of  the 
reciprocals  of  the  masses  of  all  the  particles: 


(12-37) 


mred 


To  prove  this,  let  us  write  the  Lagrangian  X  of  a  system  consisting, 
for  example,  of  two  mutually  interacting  particles  with  masses  mi 
and  tn.i 

X  =  -^  +  -mi-  -  V  (x,  -  *,),  (12.38) 


where  *,  and  *2  are  the  coordinates  of  the  first  and  second  particles, 
respectively,  and  x^  —  xt  is  the  distance  between  them.  Introducing 
the  relative  coordinate  x  =  *2  —  Xj  and  the  coordinate  of  the  center 
of  mass 


c.m          m 
we  obtain 


—  m  sum  •frc.ny  i  m  recj  ?fl  _  y  .. 


from  which,  using  the  Lagrange  equation 

A 
dt 

we  find  that  the  motion  of  the  center  of  mass  is  characterized  by 
the  total  mass 

msum-*c.m.=  const,  (12.40) 

and  the  relative  motion  of  a  particle  by  the  reduced  mass 

—  as- 
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If  the  center  of  mass  is  at  rest  (^.mf=0).  the  coordinates  *,  and 
are  related  to  the  relative  coordinate  x  by  the  equations 


MI  -f-  /Wa '  //*i  "f"  /#3* 

Accordingly,  the  moment  of  inertia  of  a  diatomic  molecule  is 

J  =  m{x\  -f-  m4*S  =  flVpri**  (12.42) 


which  is  similar  to  the  expression  for  the  moment  of  inertia  of  a 
single  material  particle  whose  mass  is  equal  to  the  reduced  mass 
and  whose  coordinate  is  equal  to  the  relative  coordinate.  Therefore, 
in  all  the  results  obtained  for  the  rotator,  we  must  substitute  Eq. 
(12.42)  for  the  moment  of  inertia,  setting  *  =  u. 

If  the   radiation   is   caused  only   by   rotational   transitions,  it 
follows  from  Eq.  (12.34)  that  its  frequency  is 

<oli/_1  =  2£/f  (12.43) 

where 

fl       »~      *,  .  (12.44) 


From  this  it  is  seen  that  rotational  spectra  (molecular  spectra 
that  result  from  transitions  between  rotational  levels)  consist  of 
sets  of  equally  spaced  lines  (see 
Fig.  12.3).  The  rotational  spectrum,  1*4 
however,  lies  in  the  far  infrared 
region  (radiation  wavelengths  of  the 
order  of  100-300  /i)  and  its  investi- 
gation is  rather  difficult.  Absorption 
lines  of  this  type  have  been  discov- 
ered, for  example,  in  the  spectra  of  2 
HC1  molecules.  A  measurement  of 
the  spacing  between  the  lines  enables  / 
us  to  determinethe  moment  of  inertia  0 
of  the  molecule. 


In  addition  to  pure  rotational  spec- |      |      | 


tra,  there  are  vibrational- rotational  d)lo  v21  uj32  w43 

spectra,    which    result  from  the  in- 
ternal   Vibrations    Of    a    molecule    in  Fig.  12.3.    Spectrum    of   a    rotator. 

conjunction  with  its  rotation.  These 

spectra   are   not   situated  as  far  in  the  infrared  as  the  rotational 

spectra  and  are  more  easily  studied. 

Let  us  consider  in  general  form  the  theory  of  a  diatomic  mole- 
cule with  varying  interatomic  distance.  This  diatomic  molecule 
represents  an  oscillating  rotator.  Without  considering  in  detail 
the  atomic  interactions  in  the  molecule,  and  using  only  simple 
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qualitative  arguments,  let  us  find  the  general  form  of  the  potential 
energy  curve  V  (r) . 

First,  we  must  set  K(r-*0)-*oo,  since  the  atoms  cannot  be 
arbitrarily  close  to  one  another.  Second,  the  interaction  between 
the  atoms  must  become  negligibly  small  as  r->oo,  and  hence 
y  (r  _^  co)  ->  o.  Moreover,  since  a  molecule  is  a  stable  system,  the 
potential  energy  V  must  be  negative  at  a  certain  finite  interatomic 
distance  r  =  a,  where  it  has  a  minimum.  Otherwise,  if  K^=0  the 
molecule  would  rapidly  dissociate  (see  Chapter  27  for  a  more 
detailed  account  of  the  bonding  energy  of  atoms  in  molecules).  The 
general  form  of  the  dependence  of  the  potential  energy  of  atoms  in 
a  molecule  on  the  interatomic  distance  is  illustrated  in  Fig.  12.4. 


Fig.  12.4.  Potential  energy  diagram  for  a  diatomic 
molecule 

If  the  departures  x  —  r  —  a  of  the  molecule  from  the  equilibrium 
position  (r  =  a)  are  relatively  small  (x<a),  the  potential  energy 
V  (r)  can  be  expanded  in  a  series  in  the  neighborhood  of  the  equi- 
librium point  r  =  a 

V(r)  =  V  (a  -|- x)  =  V  (a)  4- xV  (a)  +  f  V (a)  +  (12.45) 

Keeping  only  the  first  three  terms  of  this  expansion  and  noting 
that  the  function   V  has  a  minimum  at  the  point  r  =  a  [V (a)  =  0  and 
],  Eq.  (12.45)  can  be  reduced  to  the  form 


(12.46) 
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Here  V^^m^^  and  K(a)  =  —  D  are  the  elastic  constant 
and  the  dissociation  energy  of  the  molecule,  respectively,2 

To  find  the  energy  levels  of  the  molecule  (and  thus  its  energy 
spectrum),  we  take  the  Schrbdinger  equation  (11.21)  for  the  radial 
part  of  the  wave  function,  since  in  our  approximation  the  potential 
energy  (12.46)  possesses  a  spherical  symmetry. 

Since  we  are  interested  only  in  the  relative  motion  of  the  atoms, 
we  replace  the  mass  m  by  mred  in  Eq.  (11.21).  The  resulting  equa- 
tion is 


=  om  (12.47) 

Noting  that 

v*=S  +  }f  =  }^'  <12-48> 

and  introducing  the  function 

rR  =  u,  (12.49) 

we  obtain  after  substituting  (12.46)  into  (12.47) 


-  =  0.       <12.49a) 

Since  x<^a,  we  may  assume  in  this  equation  that  ~  =         .  3  ^  ~  . 
Then  setting  r       (a+*>       a 


£  +  D  —  £#/(/  -f  1  )  =  £',  (12.50) 

where  B  =  ^  and  ^==wred«2,  we  reduce  Eq.  (12.49a)  to  the  form 


_     te 

This  equation  is  exactly  the  same  as  Eq.  (10.14)  for  the  harmonic 
oscillator,  and,  therefore, 


t  (12.52) 

where  6  =  0,  1,  2,  3,... 


2 The  dissociation  energy  (-£))  is  defined  as  the  work  required  to  split  the  molecule 
into  atoms  (neglecting  the  vibration al  energy).  This  energy  is  usually  of  the  order  of 
several  electron  volts. 
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Thus,  when  we  consider  both  the  vibrational  and  the  rotational 
motion,  the  energy  of  the  molecule  is 

£  =  — D  +  fl*/ (/+!)+ Aw  (ft +  */«)•  (12.53) 

The  first  term  here  represents  the  dissociation  energy,  while  the 
second  and  third  terms  describe  the  rotation  and  vibration  of  the 
molecule,  respectively. 

We  note  that  the  molecule  has  only  a  finite  number  of  discrete 
energy  levels,  since  it  is  dissociated  when 


Qualitatively,  the  dissociation  of  a  molecule  at  large  quantum 
numbers  can  be  explained  as  follows.  When  k  •-->  1,  the  amplitude  of 
vibration  may  become  so  large  that  the  atoms  will  in  effect  stop 
interacting  over  this  distance  and  the  molecule  will  cease  to  exist 
as  a  bound  system.  Moreover,  if  the  orbital  angular  momentum 
numbers  /  which  characterize  the  rotational  energy  are  too  large, 
centrifugal  forces  can  also  split  the  molecule. 

We  shall  now  proceed  to  investigate  the  vibrational-  rotational 
spectrum.  We  shall  assume  that  the  spectrum  is  determined 
primarily  by  the  vibrational  energy  of  the  oscillations  because  its 
order  of  magnitude  is  larger  than  the  rotational  energy  (Avib  ~  lOjj, 
and  Xrot~  100  p.).  We  must  bear  in  mind  that  spontaneous  transitions 
can  occur  only  from  higher  to  lower  energy  levels,  that  is,  with  a 
change  from  k  to  k  —  1  [in  accordance  with  the  selection  rules,  the 
quantum  number  /  may  change  to  either  lower  (/  ->/  —  1)  or  higher 
(/->/  +  !)  values].  Then  for  the  frequency  of  the  radiation 

tu,^/:(V)-/:(fe    -1,  /j:l) 

n 
we  obtain  from  (12,53) 

u/  =  ft)-|-o)/r.  (12.54) 

Here,  in  accordance  with  (12.34)  and  (12.35),  we  have  u>,  l_l  =  2Bl, 
!,  and 


Thus,  two  branches  are  obtained  (see  Fig.  12.5): 

w*  =  wvib  +  2Bi  and  or  =  wvib  —  25  (/  +  1  )  (12.55) 
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Such  vibrational- rotational  spectra  are  observed,  for  instance, 
in  HC1  and  CO  molecules,  and  the  investigation  of  vibrational- 
rotational  spectra  is  of  great  importance  for  the  study  of  molecular 
structure.  From  these  spectra  it  is  possible  to  determine  various 
properties  of  molecules,  such  as  their  moments  of  inertia  and  the 
isotopic  composition  (since  the  moments  of  inertia  of  two  molecules 
consisting  of  different  isotopes  of  the  same  element  will  be  some- 
what different). 


Negative  branch 


Positive  branch 


l-f 


Fig.  12.5.     Vibrational-rotational    spectrum    of    a 
diatomic  molecule 

To  conclude  this  section,  let  us  consider  the  spectrum  of  a 
molecule  when  one  of  the  atoms  is  in  an  excited  state.  In  this  case 
the  vibrational- rotational  radiation  is  accompanied  by  the  transi- 
tion of  one  of  the  atomic  electrons  from  an  energy  level  n  to  a 
lower  level  n'. 

The  energy  of  such  a  molecule  can  be  written  in  the  form 

£M  =  £„-(-£,  4- E/f  (12.56) 

where  En  is  the  energy  of  the  excited  atom.  For  hydrogen,  En  is 
given  by  the  Balmer  formula  (see  Chapter  2) 


(12.57) 


The  energies  of  the  vibrational  and  rotational  motions  are,  respec- 
tively, 


and 


(12.58) 


£,  =  £/»(/+!).  (12.59) 

As   a    result  of  the  transition,  the  energy  EM*  of  the  molecule 
changes  and  becomes  equal  to 


EM'  =  En>  -f-  Eh'  -f-  EI>  . 


(12.56a) 
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Since  the  main  part  of  the  radiant  energy  is  now  due  to  the  electronic 
transition  n  ->  n'  in  the  atom,  the  quantum  numbers  k  and  /  may 
either  increase  or  decrease 

K  =  k±\,     /'  =  /iLl.  (12.60) 

Regardless  of  the  changes  in  k  and  /,  the  overall  change  in  energy 
is  a  decrease  because  of  the  emission  of  radiation  due  to  the 
electronic  transition. 

This  case  is  characterized  by  a  further  important  feature, 
namely,  the  strong  dependence  of  the  energy  bonding  the  atoms  in 
a  molecule  on  the  number  of  the  particular  shell  in  which  the  elec- 
tron is  located.  Therefore,  a  transition  causes  a  change  in  the 
bonding  energy,  which  in  turn  leads  to  a  change  in  the  interatomic 
distance  a.  In  transitions  from  an  excited  to  the  ground  stated  the 
distance  generally  increases,  together  with  the  moment  of  inertia 
y=mreda2,  whereas  the  quantity  B  =  ~  decreases.  Owing  to  the 

change  B-+B',  there  is  an  additional  slight  change  in  the  rotational 
part  of  the  energy,  which  now  becomes 

£r  =  B'ft/' (/'+!),  (12.59a) 

where  B'<B  in  the  present  case. 

Taking    into    account   all  possible  vibrational   and   rotational 

transitions,  we  find  that  the  frequency  of  radiation  UJ;M  =  — — —  is 

(12.61) 

where 

«M.  =  fl/(/  +  l)  —  BT  (/'+!).  (12.62) 

With  the  notation  o)0  =  — *~  v  ±«>,  we  can  write  Eq.  (12.61)  as 

u>M  =  o>0-f-a>/fli.  (12.61a) 

Finally,  we  obtain  three  frequency  branches  for  the  molecular  band 
spectra 

(12.63) 
(12.64) 
(12.65) 

In  these  formulas  the  first  or  positive  branch  (the  R  branch) 
corresponds  to  downward  transitions  between  the  rotational  levels, 
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and  the  second  or  negative  branch  (the  P  branch)  to  upward  transi- 
tions. The  third  branch  (called  the  zero  branch  or  Q  branch)  appears 
when  there  are  no  transitions  between  rotational  levels;  it  is  due 
entirely  to  changes  in  the  moment  of  inertia  caused  by  transitions 
within  the  atom. 

Using  Eq.  (12.62)  we  can  represent  u>+,  ur  and  u>°  in  the  form 


u>+  =  u>0-f  (B  —  B' 


(12.63a) 


(12.64a) 


(12.65a) 


These  branches  are  depicted  in  Fig.  12.6,  where  the  frequency  is 
plotted  along  the  abscissa  and  the  orbital  angular  momentum 
quantum  number  /  along  the  ordinate.  It  can  be  seen  from  this 
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Fig.   12.6.  Molecular  band  spectra:   *>+  -  the 

positive  branch;    oT  -  the  negative  branch; 

f*P  —  the  zero  branch. 

diagram  that  the  superposition  of  the  rotational  lines  o>/t  f,  on  the 
electronic-vibrational  line  o)0  gives  rise  to  a  whole  band  of  lines 
with  a  sharp  edge  on  the  left  and  a  diffuse  edge  on  the  right.  This  is 
in  complete  agreement  with  the  experimental  facts. 
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^Problem  12ffiy  Find  the  explicit  form  of  the  spherical  harmonics  for  the  cases  when 
the  oTBitai  angular-momentum  quantum  number  is  /  =  2  and  /  =  3.  Verify  their  ortho- 
normality  by  direct  calculation.  Plot  a  graph  of  I  Xj»  |8  for  in  =  0,  ±  1,  ±  2. 
Answer. 


Kf-  '  =  ±  sin  »  cos  »«±  '», 


--  J~  sin  9  (5  cos*  »  -  1 


Problem   ^2.^1  Find  the  eigenfunction  of  the  operator    L.v,  given  that  its  eigenvalue 
is  2ttlU  ftliJ  the  orbital  angular  momentum  quantum  number  is  /=  1. 
Solution.  Let  us  look  for  a  solution  of  the  equation  for  /  =  1  and 


in  the  form 

^Ci 
Using  (11.88),  with  /=  1  and  m  =  +  1,  —  1,   0,  and  applying  the  normalization  condition 

f  <p*t|;sin  to/Orf?  =1, 
we  find 


Investigate  the  general  form  of  the  motion  of  a  free  particle  in  spherical 
coordfnalfes.  juetermine  the  normalized  functions  for  /  =  0(s  states). 

Solution,    The  Schrodinger  equation  for  a  free  particle  in  spherical  coordinates  is 
written  as 


where 


"The  problems  in  this  chapter  apply  also  to  Chapter  11. 
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Introducing  the  new  function  X=  yr  /?,  we  transform  Eq.  (12.79)  to 

1  \*\ 
1 


The  solution  of  this  equation  is  a  Bessel  function  of  half-integral  order.  Since  the 
wave  function  must  remain  finite  as  r  —  0,  we  retain  only  the  Bessel  function  of  the  first 
kind.  Then 


and  for  a  free  particle  with  a  given  energy  the  general  solution  of  the  wave  equation  in 
spherical  coordinates  can  be  written  as 

nn  / 

1      '          "~*  (12.67) 


/  —  Om«-/  2 

In  particular,  for  /  =  0  (s  state),  we  have 

C    •  (12  68^ 

Since  the  spectrum  is  continuous,  this  expression  must  be  normalized  by  a  8  function 
oo 

f  /?*  (fc'f  r)  R  (*,  r)  r*dr  =  » (k  —  A'). 
oj 


Hence,  we  find;  C=  ]/  ~~  . 


V  Problem  12,4^  Find  the  selection  rules  for  quadrupole  radiation  from  a  rotator. 

iolTOon.  Ttis  necessary  to  find  the  transitions  for  which  the  matrix  element  of  the 
quadupole  moment 


<?«•  =  *  J  *Pm'  (3^s^'  -  *2^') 
=  '(3  J  Vf4fm't/mrfQ-«V  J 


does  not  vanish.  Here 

s,  s'  =  1,  2,  3;  *8  =  z  =  a  cos  ft, 

,vlf3  =  ^r  ±  ly  =  a  sin  ^  **  =  {  -~  . 

Since  the  matrix  element  of  a  bilinear  combination  can  be  expressed  as  the  sum  of  the 
products  of  linear  matrix  elements  (see  Problem  10.4), 


m",  /" 

and,  moreover,  it  follows  from  Eqs.  (12.24),  (12.25)  and  (12.26)  that  the  only  nonvanishing 
linear  matrix  elements  are  of  the  form 

e/  ±  1.  m  4-  '     ,/  ±  1,  «  —  1      ,'  ±  !•  « 
*/,  m  »     '/,  m  »    "/,  m       » 

we  obtain  the  following  selection  rules  for  the  quadrupole  radiation  from  a  rotator: 
Am  =  0,    ±.  1,    ±2;   A/  =  0,    ±2. 
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As  we  shall  see  later,  the  parity  of  a  spherical  harmonic  is  determined  only  by  the 
value  of  /  and  Is  independent  of  m  (see  Chapter  17).  Consequently,  for  dipole  radiation, 
the  only  allowed  transitions  are  from  an  odd  state  to  an  even  state  or  vice  versa  (A/  =  ±.  1 ), 
whereas  for  quadrupole  radiation,  the  transitions  can  occur  only  between  even  states 
or  between  odd  states. 


Problem  12. 5.)  Expand  the  plane  wave 

t|/  =  e'*"  =  *'*rcos*  (12.69) 

in  terms  of  spherical  waves. 

Solution.  Introducing  the  notation  kr  ~y  and  cos  0  =  r,  let  us  look  for  a  solution  in  the 
form  of  an  expansion  in  Legendre  polynomials 


Using  the  orthonormality  condition  for  Legendre  polynomials  [see  (11.68)],  we  find 

i 


Substituting  the  expression  for  the  Lengendre  polynomials  as  given  by  Eq.  (11.59)  and 
transferring  the  derivative  from  the  function  ^x*  __  \ y  to  the  function  ei^x1 1  times  (that 
Is,  integrating  by  parts  /  times),  we  obtain 


i 

"f")  llyl  J  (1  ~  *W  dX' 


Bi  = 

Z'/   \     '      £  I    - 

Using  the  well-known  relation  from  the  theory  of  Bessel  functions4 
i 


J  (i  -*.)V^=  yu/i  (  Ly+"VJ+l/3(y). 


we  determine  the  coefficients  /?/.   The  desired  expansion  of  a  plane  wave  in  terms  of  the 
spherical  waves  is  now  written  as 


P,(cos»).  (12.70) 


Let  us  emphasize  that  the  plane  wave  eikz  satisfies  the  Schrodinger  equation  for  a  free 
particle 


and,  therefore,  the  right-hand  side  of  Eq.  (12.83)  also  represents  a  linear  combination 
of  particular  solutions  of  the  above  equation  written  in  spherical  coordinates.  This 
explains  the  expected  proportionality  between  the  coefficients  Bi  and  the  Bessel  functions 


4See,    for  example,    P.    M.   Morse   and  H.    Feshbach,    Method?   of  Theoretical  Physics 
part  I,  p.  572,  McGraw-Hill  Book  Co. 


Chapter   13 

The  Theory  of  the  Hydrogen-like  Atom 
(Kepler's  Problem) 


The  Bohr  theory  of  the  hydrogen-like  atom  (see  Chapter  3)  is 
semiclassical  in  nature  and  can  only  provide  a  very  incomplete 
explanation  of  some  of  the  basic  properties  of  the  atom.  It  does  not 
enable  us  to  calculate  the  intensity  of  radiation  emitted  by  an  atom 
or  to  construct  a  theory  of  an  atom  with  more  than  one  electron. 
The  wave- mechanical  theory  of  the  atom  is  able  to  deal  with  these 
problems  without  any  fundamental  difficulties.  The  problem  of  the 
hydrogen-like  atom  presents  in  addition  a  certain  methodological 
interest  since  it  can  be  solved  exactly,  like  the  harmonic  oscillator 
and  rotator  problems.  Mathematically  this  problem  can  be  regarded 
as  a  generalization  of  the  classical  problem  of  the  motion  of  a 
planet  around  the  sun  (Kepler's  problem). 


A.    ENERGY  EIGENFUNCTIONS  AND  EIGENVALUES 
The   energy   of  interaction  between  an  electron  and  a  nucleus 

V  =  -Q  (13.1) 


depends  only  on  the  distance  r  between  them.  The  problem  of  the 
hydrogen-like  atom—  a  single  electron  moving  about  a  nucleus—  is, 
therefore,  a  typical  example  of  motion  in  a  central  field  of  force. 
If  the  origin  of  the  coordinate  system  is  at  the  center  of  the  nucleus, 
we  may  regard  the  angular  part  Y?  of  the  wave  function  as  known 
[see  Eq.  (11.67)]  and  find  the  energy  levels  and  the  radial  part 
/?(/•)  from  Eq.  (11.21),  which  for  the  present  case  is  written 
as 
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Let  us  introduce  the  effective  potential  energy  of  the  electron1 


e(f 


2/rtor31 


(13.3) 


where  the  first  term  represents  the  Coulomb  interaction  and  the 
second  the  centrifugal  forces. 


E<0 


Fig.   13.1.    The    effective    potential    energy    (solid   line)    as    a 
function  of  distance 


The  dashed  line  shows  the  behavior  of  the  wave  function 

A  graph  of  Ven    is  given  in  Fig.  13.1.  This  graph  shows  that  if 
the  total  energy  of  an  electron  is  negative  (£<0),  it  moves  within 


lWe   shall   attempt  to  interpret  Eq.    (13.3)  from  the   standpoint   of  the  classical  theory, 
using  the  classical  relationship 


A. 

2m  o 


2m  Or2 


(13  3a) 


Since  p    -  constant  for  central  forces,  we  may  write 


To  generalize  this  expression  for  the  quantum  case,   we  must  replace  p     by  its  quantum- 
mechanical  value  p£  -  tl2l(l  \    1)    In  the  same  way  we  can  regard  the  expressio 


in  Eq.  (13  2)  as  corresponding  to  the  term  pj?/2m0  in  the  classical  theory 


n  —  1~[—  V  ] 
2m0  y       ) 
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a  region  bounded  on  both  sides  by  potential  barriers  (the  case  of 
elliptic  orbits  in  the  classical  analog),  so  that  the  energy  spec- 
trum will  be  discrete. 

For  E  >  0,  there  is  no  barrier  on  the  right  (r  --  co)  and  the  posi- 
tion of  an  electron  can  range  to  infinitely  large  R  (the  case  of 
hyperbolic  orbits  in  the  classical  analog).  Since  the  electron's 
position  in  the  atom  must  be  bounded  by  a  certain  rmax  (elliptic 
orbits),  it  is  necessary  to  assume  that  £<0  in  order  to  develop 
a  theory  of  the  atom.  Accordingly,  Eq.  (13.2)  becomes 


where 

*°*/A  =  B  >  0  and  —  ^j-  ==  A  >  0.  (1 3.5 ) 

Introducing  the  new  variable 

P  =  2M"r,  (13.6) 

we  obtain  the  equation 

'    P          '    I       4     '      tfA  p  P2      '      ~~"    ' 

where  R'  =  (dR,'dp) . 

From  the  graph  of  Veff,  we  can  see  the  general  nature  of  the 
solution.  Clearly,  inside  the  well  (>'min<C/'<C'"max)  the  solution  will 
be  oscillatory,  whereas  outside  the  well  (/•--*()  and  r— »oo) 
there  are  two  solutions  (on  each  side) :  one  that  tends  to  zero  and 
a  second  that  tends  to  infinity.  To  make  the  solution  an  acceptable 
wave  function,  we  must  impose  on  it  restrictions  that  will  elimi- 
nate the  solution  that  tends  to  infinity.  Just  as  in  the  problem  of 
the  harmonic  oscillator,  it  turns  out  that  this  requirement  can  be 
satisfied  only  for  certain  discrete  values  of  the  energy  of  the 
electron. 

Since  the  potential  well  is  not  symmetric,  we  shall  look  for 
asymptotic  solutions  for  p— *0  and  for  P--CO  separately.  The 
asymptotic  solution  for  p-*oo  may  be  found,  according  to  Eq.  (13.7), 
from 

/Ji—lRa^O,  (13.8) 

which  gives 

P.  (13.9) 
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To  eliminate  the  exponentially  increasing  solution,  we  must 
set  Cj  =  0.  Coefficient  Q  can  be  included  in  the  normalization 
coefficient  of  the  wave  equation  and  can,  therefore,  be  set  equal  to 
unity.  We  then  have 

/?„  =  *-'/*.  (13.10) 

The  asymptotic  solution  for  p—  -0  can  be  obtained  from  Eq. 
(13.7),  which  in  this  case  reduces  to  the  form 

#o  =  0,  (13.11) 


Setting    /?o  =  p*.    we    find    q(g  +  !)  —  /(/  +  1)  =  0,    and  thus    </i  =  /, 
ft  =  —  (/  -f  i  ).  Consequently, 

/?.  =  ClP'  +  Cap-/-1.  (13.12) 

Setting   C*  =  0   (to   exclude   the    solution   increasing   to   infinity  at 
p  =  0  )  and  Cj  =  lf  we  obtain 

/?O  =  P'-  (13.13) 

Equation  (13.7)  can  be  rewritten  as 


and  we  assume  its  general  solution  to  be  of  the  form 

.  (13.14) 


In  this  case 

p/?  =  p<+  V/apa  =  vu,  (13.15  ) 

and  for  an  unknown  function  u  we  obtain  the  equation 


According  to  (13.15), 

lno  =  - 
and,  therefore, 

-^  +  '-ii. 
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Furthermore,  we  have 

<,=_<+!„+(_  j+£±I)V  and  i_i_!±l+£^Jl. 

Using  these  formulas,  we  transform  (13.16)  to 

=  0.  (13.17) 


If  the  behavior  of  the  solution  for  R  is  to  be  determined  at  the 
origin  and  at  infinity,  by  the  asymptotic  formulas  (13.10)  and 
(13.13),  we  must  find  the  conditions  under  which  the  function  u 
will  be  a  finite  polynomial  of  degree  k  without  negative  powers: 


Substituting    (13.18)   into    (13.17)   and  collecting  powers  of  pf  we 
have 


(13.19) 
Since  aft+J  =  0  and  a*  ^  0,  we  obtain 

tj        I.     I     I     I      1  *i  /I  Q    OflV 

— —  —  /j-j-/-)-.  1  =H.  (iG.^U) 

VA 

Here    the  quantum  number   n,  which   is   equal   to  the  sum  of  the 
orbital  angular  momentum  quantum  number 

/==0,   1,  2,  3,... 
and  the  radial  quantum  number 

*  — 0,  1,  2,  3,.,.  (13.21) 

plus  unity,  is  called  the  principal  quantum  number  n.  It  may  assume 
the  values 

n—  1,2,3 (13.22) 

To  determine  the  unknown  coefficients  av  of  the  series  (13.18),  we 
derive  a  recurrence  relation  with  the  a  id  of  Eqs.  (13.19)  and  (13.20): 

(13.23) 


208  NONRELATIVISTIC    QUANTUM    MECHANICS 

Setting  the  coefficient  of  the  highest  power  in  (13.18)  ak=  (—  1)"  and 
calculating  all  the  remaining  coefficients  with  the  help  of  (13.23), 
we  obtain  the  following  expression  for  the  function  u: 


2, 


y=o 


where  s=2/  +  1  .    The   series    (13.24)   is  an  associated  Laguerre 
polynomial  Q£(p)  of  order  k  and  may  also  be  represented  in  closed 


form2 


u  =  Ql  (p)  =  e?p~s  £  (e-V**).  (13.25) 

Thus,  for  the  radial  function  Rnl(r)  we  finally  obtain 

«»i  (P)  =  C^-'/V  <£ +,'_,  (P),  (13.26) 


where    p  =  2|A/ir.     Recalling   that    —  —  =  n,    and    substituting   the 
value  of  fl  from  Eq.  (13.5),  we  find 


where  a0  =  ~r~T  is  the  radius  of  the  first  Bohr  orbit.  Calculating 

WO^Q 

the  coefficient  Cnl    from  the  normalization  condition  (see  below), 
we  obtain 


(13.28) 


We  shall  show  here  that  the  function  u,  written  in  the  closed  form  (13  25),  satisfies 
Eq.  (13  17)  The  function  v  -=  e~ppk*s  satisfies  the  equation  pu' +  (p—  k  -  s)v  -  0,  as  can 
be  easily  verified  by  taking  the  first  derivative  of  v  with  respect  to  p  Differentiating  this 
equation  k  t  1  times  according  to  Leibnitz's  rule,  we  reduce  it  to  the  form 


Introducing  the  new  function  &       v     \>Pp~* ,  we  obtain  the  differential  equation 
pw     4  (s  f  1  -  0)u;'  f  fcu>   --    0  , 

which   is  identical  with  Eq    (13  17)  for  the  function  u  (since  (/J''^A)  —  /-  1  =  k].   Since  it 
can  be  readily  shown  that  the  coefficient  of  the  leading  term  (p  )  of  the  function 

—  ) 

is  identical  with  the  corresponding  coefficient  of  (13  24),  we  have  demonstrated  that  the 
relation  (13.25)  is  correct 
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Therefore,  the  radial  wave  function  is 

• I I          I /     T— — .  v.   . : — rrr-  I  — "  I  6       °   (Jn  —  I  —  \  I      ~~~  ]  •  /-I  o    O O n  \ 

\m*o  /  (lo.Aoaj 


As  we  know,  the  normalization  condition  for  the  radial  part  of  the  wave  function  is 


dr  =  1. 


Substituting   the  expression  for  Rni  from  (13,26)  and  replacing  r  by  9~  pf  in  accord- 
ance with  (13.27),  we  obtain  ^ 


Now  let  us  represent  one  of  the  polynomials  Q'  as  a  series  (  13.24  )t  leaving  the  other 
polynomial  in  the  closed  form  (13.25JL  The  normalization  condition  then  becomes 


Applying  the  theorem  for  the  transfer  of  a  derivative   (see  Chapter  7,  Section  C)  to  this 
equation,  we  find 


It  is  obvious  that  the  remaining  terms  in  the  series  representing  the  function  QjJ  vanish, 
since  the  order  of  the  successive  derivatives  is  higher  than  the  corresponding  powers 
of  p.  Using  the  well-known  integral 

00 

(13.29) 

we  obtain  the  expression  (13.28)  for  Cnl  „  In  a  similar  fashion,  we  can  determine  the 
average  value  of  (r~v)  p»=  1,  2,  <*,  4>,  which  will  be  useful  in  the  later  development: 

00 

/» 

On  the  basis  of  the  above  equations  we  can  rewrite  this  as 

CO 

(7-*)=CL  ("%}*  ("*'}  *   {  P~v+1(—  l)*{p*~ *(*• 

v  nl\2Z  I    \2Z  J       y        i       /   if  v 

0 

.1)1. 
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Setting  v   —    1,  2,  3  and  4  in  the  last  expression,  and  using  once  again  the  theorem  for 
the  transfer  of  a  derivative,  we  obtain  after  a  few  simple  operations 


(13'29a) 


-/(/+D 


In  calculating  r~!t  we__retain  only  the  leading  term  p*  in  the  polynomial  @J.  On  the 
contrary,  in  calculating  r~8,  we  retain  only  the  last  term  p°.  For  r~8,  we  retain  the  last 
two  terms,  and  so  on.  The  expressions  for  r~a  and  r~*  are  obtained  under  the  assump- 
tion that^  /  ^_0_  For  the  s  states  (/  =  0),  interactions  which  are  usually  proportional  to 
either  r~a  or  /-~4  are  generally  replaced  by  a  contact  interaction  (see  below,  Chapters  19 
and  20). 

To  supplement  this  general  treatment,  we  shall  calculate  the 
normalization  coefficient  of  the  wave  function  ^n/OT=/?n/K/OT  for  the 
ground  (or  lowest)  energy  state,  which  is  characterized  by  the 
quantum  numbers 


According  to  Eq.  (12.9),  the  spherical  harmonic  Y",  is  a  constant 
for  /  =  m  =  0,  as  is  the  polynomial  Q^  '  for  k  =  0m  Consequently, 
from  (13.26)  we  have  for  ^10o 

j> 
<hoo  =  GT  a»>  (13.30) 

and,  therefore,  the  normalization  condition  takes  the  form 

j  *:..ti*d'A-=  J  «..*i^2^i2  -  1.  (13.31) 

Substituting  the  explicit  form  of  the  wave  function  ^100  and  bearing 
in  mind  its  independence  of  the  angles  d  and  9,  we  obtain 


00 


J_2Zr 
o   e   "fl"°>dr  =  l.  (13.31a) 

2/^r 

Introducing  the  variable  .v=--^-  and  using  (13.29),  we  find 

U 


The  same  value  for  C  can  also  be  obtained  from  the  general  equa- 
tion (13.28)  by  setting  n  =  1  and  /  =  0.  Therefore,  the  wave  function 
for  the  ground  state  is 
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' 


The  energy  spectrum  of  the  hydrogen-  like  atom  is  found  from 
Eqs.  (13.20)  and  (13.5): 


where  /?  is  the  Rydberg  constant 


This  expression  for  the  energy,  which  we  note  is  in  complete 
agreement  with  the  corresponding  expression  of  the  Bohr  theory 
[see  Eq.  (2.50)],  depends  only  on  the  principal  quantum  number 
/j  —  /_]-,&  _|_  i  (that  is,  only  on  the  sum  of  the  orbital  and  radial 
quantum  numbers  /  and  k)  and  is  independent  of  the  magnetic 
quantum  number  m.  At  the  same  time,  the  wave  function  tynim  =  RniYf 
depends  on  all  three  quantum  numbers  //,  /  and  m  individually. 
Consequently,  it  follows  from  the  Schrbdinger  wave  theory  that  the 
energy  levels  are  degenerate.  Since  each  value  of  /  can  vary  from 
0  to  ;i—  1  and  m  can  vary  from  —  /  to  -\~l  [see  (13.20)],  the  degree 
of  degeneracy  is 

n-l      /  n-1 


It  was  shown  in  Chapter  12  that  the  degeneracy  with  respect  to 
m  is  characteristic  of  all  central  fields  of  force  and  is  related  to 
the  fact  that  there  is  no  preferred  direction  passing  through  the 
origin.  The  degeneracy  with  respect  to  the  orbital  angular  mo- 
mentum quantum  number  /  appears,  however,  only  in  the  case  of 
pure  Coulomb  interaction.  In  most  other  centrally  symmetric 
systems  there  is  no  /  degeneracy;  that  is,  the  energy  level  for  a 
given  value  of  n  is  split  into  n  sublevels  corresponding  to  different 
/.3  If  the  system  is  placed  in  an  external  field  (for  example,  a 
magnetic  field)  which  removes  the  central  symmetry,  the  de- 
generacy with  respect  to  m  also  disappears.  In  this  case,  the  nth 
energy  level  is  split  into  ri*  distinct  sublevels. 


3In  particular,  as  we  shall  see  later,  the  degeneracy  with  respect  to  /  vanishes  even  in 
the  case  of  the  hydrogen  atom  if  we  take  into  account  the  relativistic  effects,  the  nuclear 
volume,  and  the  so-called  vacuum  corrections  Similarly,  in  the  spectrum  of  alkali  metals, 
which  have  one  valence  electron  in  the  outer  shell,  the  influence  of  the  electrons  in  the 
inner  shell  removes  the  degeneracy  with  respect  to  / 
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B.    SEMICLASSICAL  INTERPRETATION  OF  THE 
PRINCIPAL  RESULTS  OF  THE  QUANTUM-MECHANICAL 
THEORY  OF  THE  HYDROGEN-LIKE  ATOM 

In  classical  theory  the  quantity 

(13.34) 


[see  (13.3a)]  must  be  greater  than  zero.  For  elliptic  orbits 
(E  =  —  |  E  |  <0)  it  is  seen  that  this  is  possible  only  when  the  radius 
r  lies  within  certain  limits  ('mm^  r  ^  rmax)  which  can  be  found  by 
setting  the  right-hand  side  of  (13.34)  equal  to  zero.  Using  the 
quantum-mechanical  expression  for  the  energy  (13.33),  we  find 


<13-35) 


The  equation  of  an  ellipse  in  polar  coordinates  is 

r==  7+7^1-  <13-36> 

b2 

where  the  parameter  P  =  ~^  is  defined  as  the  ratio  of  the  square  of 
the  semiminor  axis  b  to  the  semimajor  axis  a,  and  the  eccentricity 
e  =  — -  — —  characterizes  the  elongation  of  the  ellipse  (for  s  =  0, 

the  ellipse  becomes  a  circle).  From  (13.36)  we  readily  obtain 
equations  for  rmaxand  rmin: 

rmax  =a(l  +e), 

r       -n(\        ^  <13'37> 

'mm — "V1       E;- 

Comparing  Eqs.  (13.37)  with  (13.35),  we  find 

(13.38) 


We   see  that  the  classical  analog  of  the  quantity  — -  turns  out  to 

be  the   semimajor  axis  of  an  ellipse  whose  eccentricity  is  given 
by  Eq.  (13.39). 


THE    THEORY    OF    THE    HYDROGEN- LIKE    ATOM  213 

Substituting    into    (13.39)  the  Bohr  value  of  pj,  =  Hlnl  =  Hl(l-^  i)» 
and  then  the  quantum- mechanical  value  of  pj  =  **/(/+  1),  we  obtain 


(13.40) 
Vn=V    >-'U,J-'  <13-41> 

From  this  it  is  clear  that  the  eccentricity  becomes  exactly  zero 
only  in  the  Bohr  theory  (l  =  n—  1)  In  wave  mechanics,  for  l  =  n —  1 
the  eccentricity  has  a  minimum  nonzero  value  given  by 

s111111^!/"!.  (13.42) 

quan       \     n  ^ 

This  shows  that  in  quantum  mechanics,  we  can  speak  of  the  clas- 
sical analog  of  states  with  circular  orbits  only  when  the  quantum 
number  /  has  the  value  n  —  1  Furthermore,  it  should  be  noted  that 
in  the  classical  approximation,  an  ?>  state  (/  =  0)  gives  w££~  1, 
which  corresponds  to  a  parabolic  orbit.  This  case,  however,  cannot 
be  associated  with  a  parabolic  orbit  from  the  quantum-mechanical 
point  of  view.  As  we  can  see  from  Eq.  (13.33),  in  quantum  mechanics 
the  energy  is  negative  for  all  values  of  n  (we  recall  that  in  the 
classical  case  the  total  energy  is  zero  for  parabolic  orbits);  con- 
sequently, the  radius  for  /  =  0  is  limited  only  by  its  maximum 
value 


This  lack  of  agreement  between  the  quantum-  mechanical  and 
classical  solutions  for  the  *>  state  simply  means  that  the  case  /  =  0 
has  no  classical  analog. 

In  general,  we  are,  of  course,  entitled  to  speak  only  of  the 
probability  of  particular  events  in  the  context  of  wave  mechanics. 
Therefore,  all  the  results  obtained  from  the  Schrbdinger  theory 
must  be  interpreted  in  terms  of  probability  considerations.  Let  us 

show,  for  instance,  that  the  Bohr  radius  a  =  -£*~  [see  Eq.  (13.38)] 

for  the  case  of  circular  orbits  (/  =  /*  —  1)  corresponds  in  wave 
mechanics  to  the  most  probable  value  of  the  position  coordinate 
of  the  electron,  r  =  a. 

According  to  the  normalization  condition 


the  distribution  of  radial  probability  density  D(r)  is 

r!Rli.  (13.43) 
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In  the  case  of  circular  orbits,  when  l  =  n—  1  and  6  =  0,  Eq.  (13.43) 
gives 

/-*/?»,„__,,  (13.43a) 


According  to  Eqs.  (13.25)  and  (13.26), 

Rn  irt_,  =  const  e    2     p"'1, 

and,  therefore,  we  find  the  following  expression  for  D(r)  (see  Fig. 
13.2): 

D  (r)  =  const  fni  ~™°  •  (13.44) 

Determining  the  value  of  r  at  which  this  function  has  a  maximum: 

fdD(r)\       =0j 

we  obtain 

rn  =  a="'a9u  (13.45) 

It   is    interesting   to   note   that   if  we  set  Z—  1,  the  most  probable 
radius  r  is  the  radius  a0  of  the  first  Bohr  orbit. 


Fig.  13.2.  Radial  probability  density  distribution  function 
in  the  case  of  circular  orbits. 

If  the  radial  quantum  number  k  ^  0,  the  orbits  can  be  said  to  be 
elliptical   and  the  probability  distribution  D(r)  assumes  the  form 


Zr 


(13.46) 


The  equation  for  the  extremals  of  this  function  is 


=  const  f"e~*{  (21  +  2)  QfH  — 


(13.47) 
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Since  xQj^1  is  a  polynomial  of  the  £th  degree,  Eq.  (13.47)  has  k 
roots  (not  counting  points  r  =  0  and  r  =  oo)  and  fe+  1  maxima.  This 
case  is  very  similar  to  the  probability  distribution  for  the  motion 
of  a  free  particle  inside  a  potential  well  (sinusoidal  variation  of 
the  wave  function,  see  Chapter  4)  or  for  the  motion  of  an  oscillator 
(see  Chapter  10,  Fig.  10.2). 


C.    SELECTION  RULES,   EMISSION  SPECTRA  OF 
HYDROGEN-  LIKE  ATOMS 

To  determine  the  selection  rules  for  hydrogen-like  atoms 
(Kepler's  problem),  it  is  necessary  to  calculate  the  matrix  ele- 
ments 


Wm'  =  J  «'<•»•  f  W*-  (13.48) 

Substituting  here  tynlm~Y?Rnltwe  obtain 


00 

r  f  Yf  \  Rn>v  r*Rnldr  .  (13.48a) 

DJ 


As  we  know  from  Eqs.  (12.24),  (12.25)  and  (12.26),  integration 
over  the  angles  &  and  <p  gives  the  selection  rules  for  the  orbital 
angular  momentum  quantum  numbers  (A/  =  /  —  /'  =  :±1)  and  the 
magnetic  quantum  numbers  (Am  =  m  —  m'  =  0,  ±rl).  Using  these 
results  we  can  write  Eq.  (13.48a)  as 


<'.  '±1  fft.'.  tti  r>Rmdr.         (13.48b) 

J 

Evaluating  the  integral 

r'Rn,t.Rntdr  ~     ,»**'«  <?t  X 


[ 


4This  integral  can  be  evaluated  by  Introducing  the  new  variable  p  =  ~  (  -  -f  -~\  Then, 

±  aO  \n      n  J 

expressing  Qk  and  Q^/     in  the  form  of  polynomials!  we  can  perform  the  integration  term  by 


term,  It  is  found  that  only  the  integral 

oo 

vanishes  because  of  the  orthogonality  condition. 
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it  is  easily  shown  that  it  does  not  vanish  for  any  value  of  n'\  that 
is,  for  all  allowed  transitions  the  principal  quantum  number  can 
change  arbitrarily. 
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Fig.   13.3.    Spectral    series    of   the    hydrogen 
atom 

The   wavelengths   corresponding   to   the  indi- 
cated transitions  are  expressed  in  Angstrom 
units 


Having  obtained  the  selection  rules  for  the  hydrogen-like  atom, 
let  us  investigate  its  emission  spectrum.  We  shall  first  introduce 
certain  conventional  symbols  designating  the  energy  levels  of  an 
atom.  The  spectral  terms  ( —  Enl/h)  which  depend,  in  the  general 
case,  on  both  n  and  /  are  denoted  by  the  symbol  (nl),  that  is, 


(13.49) 


where n=  1,  2,  3,  ...,  and  I  is  replaced  by  one  of  the  letter  symbols 
s,  p,  d,  /,  gt  ht  .  .  corresponding  to  /  =  0,  1,  2,  3,  4,  5,  ...,  as 
indicated  in  Chapter  12.  Since  the  quantum  number  l^n—\9  the 
only  possible  terms  are  Is;  2s,  2p,  3s,  3,0,  M\  4s,  4p,  4d,  4/;  5s,  5p, 
5dt  5f,  5g;  and  so  on. 

There  cannot,  for  instance,  be  a  \p  term,  since  in  this  case  we 
would  have  n=  1  and  /=  1,  nor  can  there  be  a3/  term,  since  this 
would  give  us  n  =  /  =  3.  The  radiation  frequencies  expressed  in 
term  symbols  (nl)  have  the  form 
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(13.50) 


where  it  is  necessary  to  bear  in  mind  the  selection  rules  for  the 
orbital  angular  momentum  quantum  number  /,  namely,  V  =  l±_\. 
Using  Eq.  (13.33),  the  term  (nl)  can  also  be  represented  in  the 
form 


(13.51) 


where   R    is   the   Rydberg   constant  R  =  -£f-  ,    We  thus  obtain  the 
following  equation  for  the  radiation  frequency  ow : 

(13.52) 

In  the  case  of  the  hydrogen  atom(Z  =  l),  the  Lyman  series  (see 
Fig.  13.3),  which  corresponds  to  a  transition  to  the  lowest  energy 
level  n'=  \  (the  Is  level),  is  given  by 

(13.53) 

where   n  =  2,  3,  4,  ....    For   the   Balmer   series  (see  Fig.  13.4), 
which  corresponds  to  a  transition  to  the  level  n'  =  2  from  the  levels 
//^>2,    there    are    three    types    of 
allowed  frequencies: 


°^ 

«0 


(13.54) 

Since  the  energy  states  of  the  hydro- 
gen atom  are  degenerate  with  respect 
to  the  orbital  quantum  number,  these 

three      lines      merge     intO     One     (See 

Fig.  13.3),  and,  consequently, 


(13-55) 


Fig.  13.4.  Balmer  series. 

The  wavelengths  corresponding  to 
the  visible  lines  Ha,  Hfif  H-y  and 
HS  are  given  in  S  Angstroms  (A).  H^ 
gives  the  theoretical  position  of  the 
limit  of  the  series. 


A    similar  result  is  obtained  for  the  Paschen  series,  whose  fre- 
quencies (see  Fig.  13.3)  are  given  by 


en^^V^™"  "it"  J1 


(13.56 


where  n  =  4,  5,  6  , . , , 
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D.    MOTION  OF  A  PARTICLE  IN  A  COULOMB  FIELD 
IN  THE  CASE  OF  A  CONTINUOUS  SPECTRUM 

Although  the  wave  function  for  a  continuous  spectrum  can  be 
expressed  in  terms  of  a  confluent  hypergeometric  function,  the 
usual  procedure  adopted  in  the  investigation  of  the  hyperbolic 
solution  is  to  consider  the  asymptotic  behavior  of  the  hypergeo- 
metric function  for  large  values  of  r  These  asymptotic  solutions 
can  be  obtained  directly  with  the  help  of  the  quasi-classical  WKB 
method.  To  investigate  the  hyperbolic  orbits  (£  >  0),  let  us  first 
use  this  approximation  to  obtain  certain  general  results  applicable 
to  the  case  of  central  forces.  According  to  Eqs.  (11.21)  and  (11.51), 
the  equation  for  the  radial  part  of  the  wave  function  in  the  case  of 
central  forces  is 


=  Q,  (13.57) 

where 

u  =  Rr.  (13.58) 

In  the  WKB  method,  the  potential  energy  V  (r)  is  required  to  decrease 
more  slowly  than  r1  as  r->0.  Furthermore,  we  are  entitled  to 
choose  the  potential  energy  in  such  a  way  that 


Equation  (13,57)  has  a  singularity  at  the  point  /-->  0  associated 
with  the  term  1(1+  l)r\  The  effective  potential  energy  here  forms 
a  potential  barrier  of  infinite  height.  Consequently,  the  asymptotic 
forms  of  the  wave  function  which  we  obtained  earlier  for  con- 
necting the  WKB  solutions  across  a  slowly  varying  potential 
barrier  do  not  provide  a  good  approximation  in  the  present  case. 
Therefore,  in  order  to  use  the  WKB  method  in  the  case  of  central 
forces,  we  must  obtain  either  a  different  asymptotic  expression 
in  place  of  the  Bessel  functions  of  order  V3,  or  we  must  remove  the 
singularity  from  the  point  r  =  Q  to  the  point  x==  —  oo  by  introducing 
the  new  variable  x  =  \nr. 

We  shall  use  the  second  of  these  methods  and  introduce  a  new 
wave  function 


(13.59) 
Equation  (13.57)  then  takes  the  form 

d  V      t      _9.tr/2//Io    i-i          2/flo    tr/.v\          /l     i       1  \ QAM  ..          A  /I  Q    CQ\ 


to  which  the  WKB  method  is  applicable.  For  the  argument  z  which 
determines  the  asymptotic  behavior  of  ^  [see  Eq.  (5.55)],  we  obtain 
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z  = 


—  2maV(ex)  —  I 
-ijlpl*. 


dx-. 


(13.61) 


It  should  be  noted  that  here  \p\  no  longer  represents  the  momentum. 
Transforming  back  to  the  original  independent  variable  r>  we 
obtain 


dr.          (13.62) 


From  this  it  follows,  in  particular,  that  in  using  the  quasi-classical 
expression  for  the  one-  dimensional  radial  equation,  we  must  make 
the  following  change  in  the  orbital  angular  momentum: 


(13.63) 


Let  us   now   apply  Eq.   (13.61)  to  the  motion  of  a  particle  in  a 
Coulomb  field 


for  the  case  of  hyperbolic  orbits  (£>0).  In  the  quasi-classical 
treatment  of  the  problem,  it  is  necessary  to  use  the  connected 
asymptotic  solutions  (5.66)—  (5.69).  This  means  that  in  our  case 
[see  (13.62)]  we  must  evaluate  the  integral 


where 


(13.64) 
(13.65) 


and  the  value  of  rn  is  found  from  the  condition  /(r0)  =  0. 

First   let  us   determine   the   asymptotic    solution   of  the  wave 
function  for  r0  >  r  ->  0.   Substituting 

e  ~  (  I  -4-  *  \*     b—      2m°Ze»      a  —      2m«E 
c  —  <  +       •    &  —  --   —  .   a  —  --  ^ 


into  Eq.  (13.64)  and  introducing  the  notation 


(13.66) 
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we  obtain 


On  the  basis  of  Eq.   (5.51),  the  asymptotic  solution  for  this  case 
has  the  form 


where 


in  accordance  with  (13.61).  Since,  according  to  (13.58)  and  (13.59), 
the  radial  wave  function  R  is  related  to  x  by  the  equation 

*=^x.  <13-67> 

we  find 


In  order  for  the  solution  to  remain  finite  at  the  origin,  we  must 
set  D  =0;  that  is,  we  choose  the  solution  in  the  form 

/?  =  Cr<,  (13.68) 

which  is  in  complete  agreement  with  the  asymptotic  solution 
obtained  by  another  method  [see  (13.13)]. 

For    the    other    limiting   case    r-*oo,    we   have   the  following 
asymptotic  solution  [see  (5.67)  and  (5.66)]: 

f£).  (13.69) 

where,  according  to  (13.61),  p  is  equal  to 

P  =  rkk.  (13.70) 

Equation  (13.69)  is  an  analytic  extension  of  the  asymptotic  solution 
for  r->Q,  To  determine  the  quantity  z  from  (13.62),  we  use  the 
integral  (13.64)  with  the  following  substitutions: 
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(13.71) 
Then,  since  YJ  ^fcr  +  T-  we  obtain  at  r->oo 


&.    (13.72) 


We  shall  also  make  use  of  the  relations 


-  arc 


ilSLjr  arc  tan  — L-V          (13.73a) 


Here,  in  writing  the  imaginary  part,  we  took  account  of  the  fact 
that  7>0  and  f/-f  "^)>0»  and  therefore  the  angle  determining  the 

imaginary  part  of  the  logarithm  in  Eq.  (13.73)  lies  in  the  first 
quadrant,  and  the  angle  of  the  logarithm  in  Eq.  (13.73a)  in  the 
fourth  quadrant. 

The  expression  for  the  argument  z  may  now  be  transformed  to 


(13.74) 
where  the  phase  &/  is5 


f  =  -/  +      arc  tan^Lj— r!nl  + +T*  +7-     (13.74a) 


From  a  more  accurate  calculation  (a  calculation  of  the  asymptotic  expression  for  the 
argument  z  from  an  expansion  of  the  confluent  hyper  geometric  function),  the  following 
value  is  obtained  for  the  phase  ty  (see  P  M  Morse  and  H.  Feshbach,  Methods  of  Theo- 
retical Physics,  Part  II,  New  York:  McGraw-Hill,  1953): 


1  +  17) 

If  \l  +  iy\  !>  1,  then  using  Stirling's  formula 

R0         ,  — 

ru  *  f  +  17)  -  |r<i  +  /  f  »r)|e~/s/  -  V27 

we  obtain  for  the  approximation  (13  74a)  fy 
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In  particular,  when  /  =  0  and  Y>  1,  we  may  write  as  an  approxima- 
tion 


Hence,    taking   into   account    Eqs.  (13.69),  (13.67)  and  (13.70),  we 
find 


kr  —  — 


(13.75) 


l  . 

Here  C  is  the  normalization  coefficient,  and  the  total  phase  is 

2/fer.  (13.76) 

Setting  f~0  in  (13.76),  we  obtain  the  asymptotic  form  of  the  radial 
function  for  the  case  of  free  motion; 


(13.77) 


Thus  the  potential  energy  in  a  central  field  of  force  is  taken  into 
account  by  means  of  the  phase  shift 


/A3  -  ^  V  (r)  -  (/  + 1)'2  r*  dr.  (13.78) 

/"O 

where  rj  is  the  root  of  the  first  integrand. 

In  the  special  case  of  a  Coulomb  field,  this  phase  is  given  by 
Eq.  (13.76).  The  presence  of  the  logarithmic  term  depending  on  r 
in  the  phase  shift  is  the  result  of  the  long-range  character  of 
electrostatic  forces,  which  can  influence  the  particle  even  at  a 
very  large  r. 

The  phase  shifts    fy  are  functions  not  only  of  /,  but  also  of  It* 

(that  is,  of  the  energy  E  =  ^-~).  They  are  an  essential  character- 
istic of  the  eigenf unctions  of  the  continuous  spectrum.  The  value 
of  the  phase  shift  cannot  be  found  in  a  general  form.  The  quantity 
&,  has  to  be  determined  separately  for  each  specific  problem, 
usually  by  approximation  methods.  The  phase  shifts  are  of  particu- 
lar importance  in  scattering  problems,  where  they  are  used  to 
express  the  effective  cross  sections  (see  Chapter  29). 

The  wave  functions  of  a  continuous  spectrum  are  normalized  in 
terms  of  a  8  function.  The  expression  (13.75)  for  the  wave  function 
is  valid  practically  in  all  space;  there  is  only  a  small  region  near 
the  center  where  this  expression  takes  a  somewhat  different  form. 
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In  evaluating  the  coefficient  C  we  may,  therefore,  assume  that  Eq. 
(13.75)  gives  a  correct  expression  for  the  wave  function;  that  is, 
valid  in  all  space.  In  this  case,  the  wave  function  is  normalized  by 
the  relation 


We  substitute  the  expression  (13.75)  for  /?,(&)  and  neglect  the 
logarithmic  term  in  the  expression  for  the  phase  shift  8,  because 
it  increases  slowly  compared  with  r.  Consequently,  we  find 


—  ~ 


and  the  value   of  the  normalization  coefficient  in  the  asymptotic 
solution  for  the  central  field  of  force  is 


(13.79) 

An  energy  level  diagram  for  the  hydrogen  atom,  showing  both 
discrete  levels  and  the  continuous  spectrum,  is  given  in  Fig.  13.3. 
This  diagram  clearly  shows  the  degeneracy  with  respect  to  /, 
which  results  in  a  merging  of  all  levels  with  the  same  n  into  a 
single  level. 

Besides  the  ordinary  transitions  between  discrete  levels,  two 
other  processes  can  take  place— ionization  and  capture.  Basically, 
each  of  these  processes  is  the  reverse  of  the  other.  In  ionization,  an 
electron  jumps  from  a  discrete  level  (E<0),  such  as,  for  example, 
the  ground  state,  to  the  region  of  positive  energies  (£>>0)  which 
forms  a  continuous  spectrum  (hyperbolic  orbits).  This  process 
involves  the  absorption  of  energy.  Conversely,  in  the  process  of 
electron  capture,  a  free  electron  jumps  into  one  of  the  possible 
discrete  levels,  at  the  same  time  liberating  a  corresponding  amount 
of  energy. 

A  certain  amount  of  energy  is  required  to  transfer  an  electron 
from  the  ground  state  (n=\)  to  the  region  £>0.  This  energy  is 
given  by  (see  Fig.  13.3) 


where   T  =  ~-   is   the  kinetic  energy  of  an  electron  which  is  no 

longer  bound  to  the  nucleus.  The  energy  £lon  represents  the  ioniza- 
tion energy  of  the  atom.  It  is  at  its  minimum  when  T  — 0;  this  cor- 
responds to  the  minimum  energy  (E  =  0)  transition  of  an  electron 
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from  the  level  n  =  l  to  the  continuous  spectrum.  As  a  result  of 
this  transition,  the  electron  can  leave  the  atom.  For  the  hydrogen 
atom 


-=  13.59  ev. 


E.  CALCULATION  OF  THE  EFFECTS  OF  THE 
MOTION  OF  THE  NUCLEUS 

In  developing  the  theory  of  hydrogen- like  atom;  we  have  until 
now  ignored  the  motion  of  the  nucleus.  Accordingly,  our  theory  is 
rigorous  only  for  the  case  of  infinitely  large  nuclear  mass. 
In  general,  this  is  a  relatively  rough  approximation,  particularly 
in  the  case  of  light  elements  such  as  hydrogen  and  helium.  By 
taking  into  account  the  motion  of  the  nucleus,  it  is  possible  to 
explain  a  number  of  important  experimental  facts. 

We  shall  allow  for  the  effects  of  the  nuclear  mass  M  in  the 
same  way  as  we  did  in  our  discussion  of  the  spectra  of  diatomic 
molecules  (Chapter  12),  replacing  everywhere  the  electron  mass 
m0  by  the  reduced  mass 


M 
The  Rydberg  constant  then  becomes 


(13.80) 


As   a  consequence   of  this   change,  the  term  values  are  slightly 
shifted: 

(/!/)= -jjj^l   —  '-^j.  (13.82) 

The  radiation  frequency  is  therefore  given  by  the  relationship 

(13.83) 
which  differs  from  the  previous  one  [see  Eq.  (13.52)]  by  the  factor 


Since  the  frequency  of  the  radiation  depends  on  the  nuclear  mass 
M,  atomic  weights  can  also  be  determined  by  spectroscopic  methods, 
as  well  as  by  conventional  chemical  methods.  One  successful 
outcome  of  the  application  of  the  spectroscopic  method  was  the 
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proof  of  the  existence  of  heavy  hydrogen  and  ionized  hellium.  Pre- 
viously, the  average  atomic  weight  of  hydrogen  (relative  to  oxygen) 
was  found  by  chemical  means.  The  mass  spectrograph  made  it 
possible  to  measure  the  atomic  weight  of  each  atom.  These 
measurements  gave  somewhat  different  values  for  the  atomic  mass: 


Me: 


-  Af  tn  sp 
^chem 


MOO  %  ^0.0 145% 


(13.84) 


Hydrogen  (H*) 


This  led  Birge  and  Menzel  to  predict  the  existence  of  a  hydrogen 
isotope,  called  deuterium  or  heavy  hydrogen,  with  anatomic  weight 
twice  that  of  hydrogen  (D=tH*).  The  presence  of  deuterium  in 
natural  hydrogen  explains  the  greater  atomic  weight  obtained  in 
chemical  measurements.  The  mass  spectrograph  measures  the 
atomic  weight  of  tHl  along,  wince  the  spectral  lines  of  iH*  atoms 
fall  at  a  different  place  on  the  scale. 
Deuterium,  like  hydrogen,  can 
enter  into  reactions  forming,  for 
example,  heavy  water  D*O,  discov- 
ered by  Urey  and  Osborn  in  1932. 
Deuterium  is  usually  obtained  via 
the  electrolytic  decomposition  of 
water.  The  rate  of  evolution  of  or- 
dinary hydrogen,  at  the  cathode, 
greatly  exceeds  the  rate  of  evolu- 
tion of  deuterium;  thus,  the  concen- 
tration of  deuterium  in  the  residual 
electrolyte  increases.  (It  is  almost 
impossible  to  detect  deuterium  in 
natural  water  because  of  its  low  concentration.)  The  presence  of 
deuterium  was  confirmed  by  spectroscopic  studies  which  showed 
that  not  only  does  the  Balmer  series  (n'  =  2)  consist  of  the  lines 


1 

Deuterium 

I 

Tritium  ( 

\ 

Fig.  13.5.     Diagram    of    the    relative 

position     of    the     spectral    lines     of 

hydrogen  and  its  isotopes. 


Balmer 


1 


(13.85) 


but  that  each  of  these  lines  is  associated  with  a  second  line  situ- 
ated somewhat  to  the  right.  This  second  series  of  lines  (see  Fig. 
13.5)  is  described  by  the  equation6 


Balmer 


' 


(13.86) 


According  to  the  latest  experimental  data, 

Rm  =  2nc  •  109737,          KH  =  27rf  •  109678,         RQ  =  2frc  •  109707, 
where  the  numbers  represent  the  values  of  R9p  (see  footnote,  page  20). 
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which  can  be  readily  obtained  from  (13.83)  by  setting  the  mass  M 
equal  to  twice  the  mass  of  the  hydrogen  nucleus  and  substituting 
Z  =  1  It  is  worth  noting  that  the  large  relative  difference  between 
the  masses  of  the  deuterium  and  hydrogen  atoms  causes  far 
greater  differences  in  their  physical  and  chemical  properties  than 
is  usual  with  isotopes  of  other  elements.  Thus,  although  heavy 
water  outwardly  resembles  ordinary  water,  its  melting  and  boiling 
points  are  3.81  °C  and  101.4°C,  respectively,  its  viscosity  is 
greater  and  it  is  a  poorer  solvent  for  salts.  With  the  development 
of  nuclear  physics,  heavy  water  has  become  particularly  import- 
ant because  it  is  a  good  moderator  for  fast  neutrons,  and  can  also 
be  used  as  a  source  of  deuterium. 

At  present,  we  know  of  a  third  hydrogen  isotope,  namely  tritrium 
(T  —  jH-*),  whose  nucleus  consists  of  two  neutrons  and  one  proton.  It 
forms  a  compound  with  oxygen  similar  to  water.  The  ratio  of 
tritium  atoms  to  iH1  atoms  in  natural  water  is  approximately  10~18f 
whereas  the  ratio  of  iD'2  atoms  to  ill1  atoms  is  1/6800.  In  a  mixture 
with  deuterium,  tritium  is  a  very  important  substance  for  the  pro- 
duction of  thermonuclear  reactions:  the  reaction  between  ,D*  and 
iTJ  nuclei  leads  to  the  formation  of  2He4  and  one  neutron.  Each 
such  reaction  releases  more  than  17  Mev  of  energy.  Tritium  is 
also  a  beta  emitter  (with  a  half- life  of  12  years)  and  consequently 
is  widely  used  as  a  radioactive  indicator  in  chemical  and  biological 
investigations.  The  positions  of  spectral  lines  of  tritium  are  slightly 
displaced  relative  to  the  hydrogen  and  deuterium  lines  (see  Fig. 
13.5)  and  are  given  by  the  equation 

"aimer         "    '•  M/l          n  (13.87) 


Another  very  important  consequence  of  accounting  for  the 
motion  of  the  nucleus  was  the  discovery  of  ionized  helium,  first 
detected  in  spectroscopic  studies  of  the  sun.  The  solar  spectrum 
was  found  to  contain  a  series  of  lines,  with  positions  described  by 
the  equation 


(13.88) 


Energy  is  released  In  the  fusion  of  deuterium  and  tritium  into  helium,  just  as  in  the 
fission  of  U23S  or  Pu239  under  neutron  excitation.  Fusion,  however,  is  possible  only 
if  the  potential  barrier  of  the  Coulomb  repulsion  between  the  D  and  T  nuclei  is  overcome, 
High  temperatures  (~  108  degrees)  are  therefore  required  if  this  reaction  is  to  be  self-sus- 
taining, whereas  in  the  case  of  fission  even  low-energy  neutrons  can  easily  penetrate  the 
nucleus,  and  that  even  at  low  temperatures.  To  obtain  a  thermonuclear  reaction,  a  mixture 
of  deuterium  and  tritium  must  first  be  heated  to  a  temperature  of  tens  of  millions  of  degrees. 
Such  temperatures  may  be  created  in  an  atomic  explosion. 
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where  n^  takes  the  values 

"i  =  f-    3'  T-  4'  I'""  <13'89) 

This  series  is,  in  effect,  the  Balmer  hydrogen  series  (/n  =  3,  4,  5,...) 
with  a  number  of  intermediate  lines,  which  form  the  so-called 
Pickering  series,  characterized  by  the  half-integral  quantum 
numbers  n^=  5/2,  7,2,  9/2,  ....  At  first,  the  Pickering  series 
was  explained  by  assuming  that  hydrogen  was  in  a  special  state 
in  the  Sun,  so  that  the  quantum  number  n  could  assume  half- 
integral  values.  The  spectral  lines,  however,  were  later  found  to 
be  located  further  to  the  right  than  is  indicated  by  Eq.  (13.89), 
and  consequently  this  assumption  had  to  be  abandoned.  The  second 
hypothesis  assumed  that  the  observed  spectrum  arises  from  singly 
ionized  helium  (,  He4)4  ,  whose  nuclear  mass  is  Al  —  7360/?20  and 
whose  charge  is  Z  —  2.  According  to  (13.83),  its  radiation  fre- 
quencies then  are 


(13.90) 
Setting  ;i'  =  4,  we  reduce  (13.90)  to  the  form 


where  /i  =  5,  6,  7,  8,  .  .  .  . 

To  answer  the  question  of  whether  the  Pickering  series  was 
due  to  the  radiation  of  hydrogen  atoms  (under  the  assumption  that 
the  quantum  numbers  may  assume  half-integral  values)  or  to  the 
radiation  of  ionized  helium  atoms  (with  the  usual  integral  values  of 
the  quantum  numbers),  it  was  necessary  to  find  an  experimental 
value  of  the  Rydberg  constant.  In  the  case  of  hydrogen, 


(13.92) 

whereas  for  helium 

(13.93) 


Careful  spectroscopic  studies  confirmed  that  the  Rydberg  constant 
has  the  value  (13.93),  and  thus  it  was  shown  that  the  Pickering 
series  represents  the  spectrum  of  ionized  helium. 
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Problem  13.J^  Starting  with  Eqs.  (13.26)  and  (13.28),  show  that  the  radial  wave  func- 
tl on*S "Wn/Mr TflSTSrincipal  quantum  numbers  n  =  I,  2  and  3  are  as  follows: 


'"  (6 -6p+P«), 


\§1  — — >  ,j^r  1 

9  V  6 


where 


_2Zr 


On   the   basis    of  these  specific  examples,  show  that  the  functions  Rnl  are  orthonormal, 
that  tsf 


Hint.    In  proving  orthonormality,  make  use  of  the  fact  that  at  different  n  and  n\  the 
corresponding  values  for  p  also  differ  from  one  another. 

r——*^ 

LProblem  13.ZJ  Show  that  the  average  electrostatic  potential  produced  by  a  hydrogen-* 
ground  state  is 


+  iy-    («,=-»!,). 

aa~  rj  \          m0^/ 


Hint.    Find   the   average  electrostatic  potential  produced  by  the  electron  <J>  j    (r) 


—  ?o  l 
J 


(r1)  . 


.  —  _    ,  . 
I  *       •    I 

over  the  solid  angle  Q'>  use  the  identity 


^  (r')  ^3.v'  and   add   it   to   the   potential  of  the  nucleus.  Integrating 


T  Snow  that  at  ^  =  If  ^e  maximum  of  the  probability  density  distri- 
bution"  u  —  r*R*  in  the  states  Is,  2/?  and  3d  occurs  at  distances  of  a0,  4o0  and  q</rt  from 
the  nucleus  (circular  orbits).  Why  do  the  2s,  3s  and  fy  states  have  several  maxima 
(see  Fig.  13.6)7 


THE  THEORY  OF  THE  HYDROGEN- LI  KE  ATOM 

3s 


229 
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Fig.  13.6.  Graph  of  the  radial  probability  density  distribution  D  =  r*R* 
for  various  states. 

problem  13.4J  Using  the  functions  given  in  Problem  13.1,  verify  the  following  equa- 
lons  !T>r'""tt=i= T,~~i  and  3: 


and 


Find  the  spread  (Ar)3  =  r*  —  r8  of  the  radial  deviations  for  these  states.        _________ 

On  the  basis  of  Fig.  13.6  and  the  uncertainty  principle,  explain  why  (Ar)adoes  not 
/anish  for  circular  orbits  in  the  quantum  theory. 

Cgj-phignq  ly^Ry  means  of  the  quasi-classical  WKB  method,  find  the  discrete  energy 
spectrum  of  the  hydrogen-like  atom. 

Solution.  According  to  (5.75)  and  (13.62),  the  eigenvalues  of  the  discrete  spectrum 
[E  <  0)  may  be  found  from  the  equation 


where  r\  and  r»  are  roots  of  the  integrand  and 


Since  the  value  of  this  integral  Is8 


ve  obtain  exactly  the  same  expression  for  the  energy  of  the  hydrogen-like  atom  as  via 
he  Schrodinger  theory. 


8This    integral   can    be   easily   evaluated   from    (13  64)   by    setting  a  ~  —  A,    b  -  2B     and 
-  __(/  4.  1/2)2,  and  regarding  rl  and  r2  *>  rj  as  roots  of  the  equation  /"(r)  -  0    The  logarithm 
»f  the  complex  quantity  [assuming  that  a  tends  to  zero  from  the  direction  of  positive  num- 
>ers  (a  =  \  0)1  may  be  taken  as: 

?0     /3  ^  0  , 

in    fl  <Q  t 


2Tt/3>0  , 
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Problem  13.6J  Determine  that  magnetic  field  intensity  at  the  center  of  the  hydrogen 
aTUlii  MMUi  li>  Ttue  to  the  orbital  motion  of  an  electron.  Find  its  numerical  value  for  the 
1p  state.  

Answer.  Hx  —  H  =  Q, 


//,=-/ 

For  the  2p  state  (m  =  1) 


gauss. 


Hint,     lake    the    classical    expression    //  =  --  ?-—  r  x  p  for  the  intensity  of  the 

magnetiv  field  produced  by  a  moving  charge.  Transform  to  a  quantum  treatment  of  the 
problem  by  replacing  r  x  p  by  the  angular  momentum  operator  L.  Then,  using  Eq. 
(13.29a),  calculate  the  average  value  of//. 


Chapter  14 

Time-Independent  Perturbation  Theory 


(A    BASIC  PRINCIPLES  OF  THE  PERTURBATION 
^  TREATMENT  OF  PROBLEMS 

A  relatively  large  number  of  problems  in  quantum  mechanics 
cannot  be  solved  exactly  with  present-day  mathematical  methods. 
Thus,  various  approximate  calculations  have  to  be  used.  One  of 
the  most  widely  used  approximations  is  the  perturbation  theory 
method,  which  was  first  developed  to  handle  problems  in  celestial 
mechanics.  It  is  well  known  that  in  Newtonian  mechanics  it  is  pos- 
sible to  solve  exactly  only  the  two-body  problem  (for  example,  the 
Earth-Sun  or  the  Moon-Earth  problems).  We  cannot,  however,  ne- 
glect interplanetary  forces  and  consider  only  the  attraction  of  the 
planets  by  the  Sun  since  many  delicate  phenomena  are  associated 
with  these  additional  interactions  (it  is  worth  recalling  in  this  con- 
nection that  Lever rier  predicted  the  existence  of  Neptune  on  the 
basis  of  the  orbital  deviations  of  Jupiter,  after  which  the  planet  was 
discovered  by  astronomers).  It  thus  became  necessary  to  consider 
the  many-body  problem,  which  has  no  exact  solution  in  classical 
mechanics.  In  celestial  mechanics  it  was  found  that  the  perturba- 
tion problems  could  be  handled  by  means  of  an  approximation  based 
on  the  fact  that  the  forces  between  the  planets  are  much  smaller 
than  the  force  of  attraction  to  the  Sun.  In  this  method,  one  starts 
by  solving  the  two-body  problem  (the  zero-order  approximation), 
then  takes  the  "perturbation"  into  account  and  finds  the  correction 
to  the  solution  (the  first-order  approximation).  In  other  words,  the 
4 'perturbation  method"  involves  taking  the  principal  forces  acting 
on  a  body,  finding  the  rigorous  solution  for  these  forces,  and  then 
taking  the  "perturbing"  forces  into  account. 

Similarly,  in  the  quantum-mechanical  treatment  of  the  motion 
of  several  electrons  in  an  atom,  it  is  necessary  to  consider  first 
the  principal  forces,  such  as,  for  example,  the  force  between  the 
nucleus  and  an  electron.  In  this  case,  the  perturbing  forces  may 
be  taken  to  be  the  Coulomb  forces  of  mutual  repulsion  between  the 
electrons.  In  the  problem  of  an  atom  subjected  to  an  external 
electric  or  magnetic  field  whose  strength  is  small  relative  to  the 
electric  field  of  the  nucleus,  the  perturbation  may  be  taken  to  be 
the  energy  of  the  electron  in  the  external  field. 
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£}  FUNDAMENTAL  EQUATIONS  OF  PERTURBATION 

THEORY 

We  shall  now  develop  the  perturbation  theory  in  a  form  suitable 
for  stationary  problems,  that  is,  problems  in  which  the  Hamiltonian 
of  the  system  does  not  depend  on  time.  Suppose  the  Hamiltonian 
has  the  form 

H  =  T  +  l/  =  T  +  l/°  +  I/',  (14.1) 

where  the  perturbation  energy  V  <  K°,  and  the  main  part  of  the 
potential  energy  l/°  is  chosen  in  such  a  manner  that  the  Schrbdinger 
equation  of  the  system 

(E  —  H)<p  =  0  (14.2) 


has  an  exact  solution,  characterized  by  £°and  <j>°,  when  the  pertur- 
bation V  is  neglected.  Setting  T  +  V*  =  H°  (the  zero-order  approx- 
imation) and  using  Eq.  (14.1),  we  can  write  (14.2)  as 

(£_H°  —  1/'H  =  0-  (14.2a) 

The  basic  problem  in  perturbation  theory  is  to  calculate  from  this 
equation  the  energy  values  E  n  and  the  corresponding  eigenfunctions 
<|>n.  The  solutions  are  sought  in  the  form  of  the  series 


E"-|-...f  (14.4) 

where  <]/  and  E'  are  terms  of  the  first  order  of  smallness  relative 
to  <!>°;  £°,  <]/'  and  E"  are  terms  of  the  second  order  of  smallness; 
and  so  on. 

As  a  rule,  the  perturbation  energy  V  can  be  represented  in  the 
form  of  a  potential  energy  of  the  same  order  as  VQ,  multiplied  by 
some  small  parameter  X(X<1).  The  solutions  (14.3)  and  (14.4) 
should  then  appear  as  expansions  in  terms  of  X.  Thus,  £°  and  <i>° 
will  be  independent  of  this  parameter,  E'  and  f  will  be  propor- 
tional to  X,  E"  and  <]/'  to  X2,  and  so  on.  In  the  expressions  for  <|*  and 
E,  let  us  restrict  ourselves  to  terms  of  the  first  order  of  small- 
ness  (that  is,  we  shall  retain  only  terms  which  are  independent  of 
X  or  directly  proportional  to  X).  Substituting  (14.3)  and  (14.4)  into 
(14.  2a),  we  obtain  the  following  equation  for  <j/  and  £': 

(£°  +  £'  —  Hft  —  V)  (f  +  f  )  =  0.  (14.  2b) 

f 

Collecting  terms  of  the  same  order,  we  have 

(£•  -  H»)  f  -f  [(£'  -  V")  f  +  (£•  -  H«)  f  1  +  (£'-  V)  f  =0.  (14.2c) 
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Equation  (14.2c)  may  be  regarded  as  an  exact  equation,  since  we 
have  not  neglected  any  terms  in  it,  and  since  <|/  and  £'  may  be  taken 
to  represent  the  sums  of  all  terms  of  different  orders  of  smallness 
(that  is  .  f  ^  f  -f-  <|/'  -f  ...,£'->£'  +  £''  +  ...).  To  obtain  the  first- 
order  approximation  of  perturbation  theory,  we  neglect  the  second- 
order  term  (E'—V)  <]/  in  (14.2c)  and  use  the  equation  for  the  zero- 
order  approximation 

(£°  —  H°)f  =  0.  (14.5) 

This  equation  yields  the  zero-order  eigenvalues 

£0      170      p»  po 

1>     ^it     *"3»     •••»     "tit     •••} 

and  eigenfunctions 


which  are  connected  by  the  relationship 

(£);>-  H°H;V  =  0.  (14.6) 

Keeping  all  this  in  mind,  we  shall  now  investigate  the  equation  for 
the  first-order  approximation  of  perturbation  theory 

(£<»  -  H°)  f  =  -  (£'  -  V)  <!/>.  (14.7) 

We  assume  that  at  the  beginning,  the  system  is  in  the  quantum 
state  n'  =  n  Since  £°  —  £,;  and  «|/'  =  W  in  the  zero-order  approxima- 
tion, and  £'==£„  and<j*'  =  'j>«  in  the  first-order  approximation,  Eq. 
(14.7)  becomes 

(£S  -  Hn)  *;  =  -(£;-  V)  ft.  (14.7a) 

We  recall  that  an  arbitrary  function  can  be  expanded  in  a  series 
of  orthonormal  functions  forming  a  complete  set  and  satisfying  the 
same  boundary  conditions  as  the  original  function;  therefore,  we 
may  assume  the  solution  for  ^  to  be  of  the  form 

n,^  (14.8) 

Our  problem,  therefore,  reduces  to  determination  of  the  unknown 
coefficients  Cn>  of  a  generalized  Fourier  series.  Substituting  (14.8) 
into  (14.7a),  we  have 
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or,  taking  into  account  (14.6),  we  find 

n>  (E9n  -  £M  ft*  =  -&-  V)  «.  (14.9a) 


-  £,       a    NONDEGENERATE  CASE 

Let  us  suppose  that  the  system  is  nondegenerate.  Therefore, 
each  energy  eigenvalue  En  corresponds  to  one  and  only  one  eigen- 
function  <[»;.  Then,  multiplying  Eq.  (14.9a)  on  the  left  by  <J»J*  and 
integrating  over  all  space,  we  have 

n>  (Enn  -  EM  *nn>  =  -  E'H  +        *1*  V^S  #*•  (14.10) 


In  obtaining  this  equation,  we  have  made  use  of  the  orthonormality 
of  the  eigenfunctions  <|>JJ: 


Since  the  left-hand  side  of  Eq.  (14.10)  is  equal  to  zero  (£J  —  £J.  =  0 
for  rt'==tt,  and  &/m'  —  0  for  nf  -£  n),  the  energy  correction  E'n  is 

^==V;m,  (14.11) 

where 


Thus,  the  energy  correction  Ei  of  the  system  quite  naturally  turns 
out  to  be  equal  to  the  average  value  of  the  perturbation  energy  V. 
It  is  worth  noting  that  the  expression  (14.11)  for  the  energy 
correction  E'n  was  obtained  by  setting  the  left-hand  side  of  Eq. 
[14.  7a)  equal  to  zero  after  it  had  been  multiplied  by  <[»J|*  and  in- 
tegrated over  all  space.  Since  y*  is  a  solution  of  the  homogeneous 
aquation  (14.6),  it  follows  that  the  right-hand  side  of  the  inhomo- 
geneous  equation 

M't—  /  (14.12) 

is   orthogonal  to  the  solution  of  the  corresponding  homogeneous 
equation  Mf  =  0,  that  is, 

°*/d3*  =  0.  (14.13) 
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Let  us  now  proceed  to  determine  the  wave  functions  (that  is, 
the  coefficients  Cn>)  of  the  Schrodinger  equation  (14.7a)  in  first- 
order  perturbation  theory.  We  write  Eq.  (14. 9a)  as 


Multiplying  on  the   left   by  ^y(n'^n)9   using  the   orthonormality 
condition,  and  integrating  over  all  space,  we  obtain 


where 

V;*=Jt?y'»ld"*.  (14.15) 
Thus  ,  for  ty'n   we  have 

T-i  =  CnW  +  2'  C"*'  (14.  16) 


where  the  prime  on  the  summation  symbol  indicates  that  the  sum 
is  taken  over  all  n'  except  n'  =  n.  Finally,  the  as  yet  undetermined 
coefficient  Cn  of  the  zero-order  wave  function  can  be  found  from 
the  normalization  condition 


^=l  (14.17) 

for  the  total  wave  function 

t«  =  W  +  #  =  Ci  W  +  2'  C""W'>  (14.  18) 

n' 

where 

«=!+<;„.  (14.19) 

-    Substituting   (14.18)  into  (14.17)  and  keeping  only  terms  up  to 
the  first  order,  we  have 


'JCS*^  J  «**!'  <P*  +  CJ'CJ  J  tt»ft  d'x}  =  1  .         (14.20) 
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Making  use  of  the  orthonormality  condition  and  ignoring  the  phase 
factor,  which  is  of  no  interest  to  us,  we  obtain 

CS=1.  (14.21) 

and  therefore  Cn  =  Q» 

Consequently,    for    the    wave    function   <]»„   in  the   first-order 
perturbation  theory,  we  finally  obtain 


-f    '  —E^ 


From  Eqs.  (14.22)  and  (14.11),  it  can  be  seen  that  both  <K  and 
E'n  are  proportional  to  the  first  power  of  the  perturbation  energy 
(that  is,  to  the  parameter  A).  If  we  were  to  compute  the  corrections 
to  the  energy  and  the  wave  function  in  second-order  perturbation 
theory,  both  E"n  and  <]>J  would  turn  out  to  be  proportional  to  the 
second  power  of  V"(that  is,  to  X2 ). 


D.     DEGENERATE   CASE 

We  shall  now  develop  the  perturbation  theory  in  a  form  applica- 
ble to  degenerate  systems,  in  which,  in  the  absence  of  any  per- 
turbation, a  given  energy  eigenvalue  EQn  has  associated  with  it  ; 
eigenfunctions  iJ-.'^  *^-  (>-.^  ^  •**'(*  (*  \«*>\ 


It  is  obvious  that  any  linear  combination  of  these  functions 

/ 


(14'23) 


is  itself  a  solution  of  the  wave  equation  in  the  zero-order  approxi- 
mation 


If  there  is  a  perturbation  V,  this  arbitrariness  is  removed  and 
the  coefficients  Q  may  become  connected  with  one  another  by 
certain  definite  relationships.  Let  us  show  this,  proving  first  that, 
just  as  in  the  case  of  nondegenerate  states,  any  particular  solution 
of  the  homogeneous  equation  (14.6)  is  orthogonal  to  the  right- 
hand  side  of  the  inhomogeneous  equation  (14. 7a)  of  first-order 
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perturbation  theory*    For  this  purpose,  we  multiply  (14. 7a)  on  the 
left  by  tyn*  and  integrate  over  all  space.    We  then  obtain 

f  W  (En  —  H°) <!>'  d'x  =  —  f  <|»J*  (En  —  V)  K  d*x.  (14.24) 

«)      *  J      * 

Using  the  theorem  for  the  transfer  of  a  derivative  [see  Eq.  (7.17)], 
we  have 

j  ¥  (Ei  -  H")  W;  d'jc  =  -  J  T  JJ  (£;  -  F) «  #x.  (14.25) 

Since  ^Ji*    is  a  solution  of  the  Schrodinger  equation  (En  —  H")'^!c  =  0, 
we  arrive  at  the  equation 

,,  ^ 

\  ^*  (£;  —  V)  2]  C!^S|f  rf»jc  =  0  (14.26) 

Without  any  restriction  on  the  generality,  we  may  assume  that  all 
the  eigenf unctions   <1>H,,  are  orthonormal.1   Then,  since 


we  obtain   instead  of  (14.26) 

^' 
Q (El ~V',,)=2i  C*>' Vli'>  (14-2?) 

iSi 

where 

y'ymi<Pxt  (14.28) 

;v'wx*,  (14.29) 

and  the  prime  on  the  summation  sign  indicates  that  the  sum  extends 
over  all  /"s,  except  i'  =  im  Since  the  subscript  /  in  (14.27)  can  take 
any  value  from  1  to  /,  we  have  a  system  of  j  homogeneous  equations 
from  which  we  can  determine  the  energy  E'n  and  the  coefficients  C] : 


(14.30) 


-  c;  v ,;  -  c;i/;2  - . . . + c;  (£;  -^  y ;,) = o. 


If  the  functions    i/;fl  ,  are  not  orthonormal,  it  is  always  possible  to  construct  from  them 
by  means  of  linear  transformations,  new  functions  which  possess  this  property. 
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Recalling  that  the  wave  function  <1»1  must  satisfy  the  normalization 
condition 

*tS  <***=!,  (14.17a) 

we  see  that  the  correction  En  to  the  energy  of  the  unperturbed 
state  En  of  the  system  is  uniquely  determined,  as  are  the  coeffic- 
ients Ci  (and  hence  alsot^).  Since,  in  particular,  the  system  of 
equations  (14.30)  can  have  a  nontrival  solution  only  if  its  determi- 
nant is  equal  to  zero,  we  have  the  following  equation  for  E'ni 


=  0,  (14.31) 


This   equation   is  called  the  secular  equation,  a  term  taken  from 
celestial  mechanics.  -  ^ 

If  the  secular  equation  has  several  roots  (not  necessarily  / 
different  roots),  each  one  of  them  will  correspond  to  a  completely 
determined  set  of  coefficients  CJ,  which  can  be  found  from  (14.17a) 
and  (14.30)  by  substituting  the  given  root  E'flk  for  E'n  .  Consequently, 
the  different  corrections  E'n  to  the  energy  lead  to  different  zero- 
order  wave  functions.  Thus,  if  a  system  is  /-fold  degenerate  in  the 
absence  of  any  perturbation,  a  possible  effect  of  a  perturbation  is 
to  reduce  or  completely  remove  the  degeneracy  [this  will  occur  in 
the  case  when  Eq*  (14.31)  has  /  different  roots]. 


E.    THE  STARK  EFFECT 

If  an  atom  is  placed  in  an  electric  field,  its  spectral  lines  are 
generally  split  into  components.  This  phenomenon  was  discovered 
by  Stark  in  1913.  Experiments  have  shown  that  the  effect  of  an 
electric  field  on  a  hydrogen  atom  is  different  than  that  on  other 
atoms.  In  a  weak  field,  the  splitting  of  the  energy  levels  of  hydro- 
gen (for  example,  the  Balmer  series)  is  proportional  to  the  first 
power  of  the  field  intensity  (the  linear  Stark  effect) ,  whereas  for 
all  other  atoms  the  splitting  is  proportional  to  the  second  power  of 
the  field  intensity  (the  quadratic  Stark  effect).  In  a  stronger  field 
(of  the  order  of  105  volts/cm),  there  is  an  additional  splitting  ef- 
fect, proportional  to  higher  powers  of  the  field  intensity  (second 
power  in  the  case  of  hydrogen).  In  very  strong  electric  fields,  the 
spectral  lines  disappear  completely. 

The  Stark  effect  could  not  be  explained  before  the  development 
of  quantum  mechanics.  In  terms  of  classical  concepts,  the  motion  of 
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an  electron  in  an  atom  can  always  be  resolved  into  three  mutually 
orthogonal  oscillations.  Let  us  consider  the  oscillation  along  the 
z  axis  ,  which  can  be  taken  to  coincide  with  the  direction  of  the 
constant  electric  field  E(EX  =  EV  =  0,  Ez  =  §)  The  equation  describ- 
ing the  motion  of  the  electron  along  the  z  axis2  is  (for  e  =  ~  e0) 

mQz  +  MQu>lz  =  —  £<>&,  (14.32) 

where  w0  is  the  electronic  mass  and  <»0  is  the  angular  frequency  of 
oscillations.  It  is  readily  seen  that  the  solution  of  Eq.  (14.32)  is 

z  =  -  ~£!r  +  A  cos  (u)°'  +  *>•  <14-33) 

It  is  clear  that  in  classical  theory  the  only  effect  of  the  constant 
force  (  —  e0g)  is  to  change  the  position  of  the  point  of  equilibrium 
of  the  system.  The  frequency  of  oscillations  remains  completely 
unaffected.  Consequently,  classical  concepts  imply  that  the  fre- 
quency of  the  radiation  emitted  by  an  atom  remains  the  same 
whether  or  not  the  atom  is  placed  in  an  electric  field,  since  the 
frequency  of  the  radiation  is  the  same  as  the  mechanical  frequency 
of  oscillations  of  the  atomic  electrons.  Accordingly,  an  electric 
field  cannot  produce  any  shift  of  the  spectral  lines  in  classical 
theory. 

We  shall  now  consider  the  Stark  effect  in  terms  of  quantum 
concepts.  As  we  have  just  mentioned,  there  are  two  basic  forms 
of  the  Stark  effect:  namely,  linear  and  nonlinear.  The  linear  effect 
is  characteristic  only  of  hydrogen-like  atoms  because  they  are 
degenerate  not  only  with  respect  to  the  magnetic  quantum  number 
tn,  but  also  with  respect  to  the  orbital  angular  momentum  quantum 
number  /  (see  Chapter  13),  with  which  the  linear  Stark  effect  is 
associated.  In  all  other  atoms,  there  is  no  degeneracy  with  respect 
to  /,  and  therefore  the  linear  Stark  effect  is  not  observed. 

Let  us  discuss  in  greater  detail  the  theory  of  the  linear  Stark 
effect  for  the  hydrogen  atom.  We  shall  confine  our  treatment  to 
the  second  quantum  level  (/i  —  2).  3  The  external  electric  field 
(which  is  of  the  order  of  10l  —  10*  volts/  cm  in  experiments)  is  much 
weaker  than  the  intraatomic  field  produced  by  the  nucleus,  which  is 


109  volts/cm, 

nucl       fljj 

where  a{}  is  the  radius  of  the  first  Bohr  orbit.  Therefore,  we  may 
use  the  results  of  perturbation  theory  (in  the  form  developed  for 


The  electric  field  intensity  will  have  no  effect  on  the  oscillations  along  the  x  and  y 
axes,  which  are  perpendicular  to  E. 

The  first  quantum  level  (n  -  1)  is  nondegenerate  and,  therefore,  is  not  split. 
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degenerate  problems).    For  the  perturbation   V",  we  must  take  the 
potential  energy  of  the  electron  in  the  external  electric  field 

V'  =  e0&z.  (14.34) 

In  the  unperturbed   state,   the  energy  of  the  electron  is  [see  Eq. 
(13.33)] 


which  has  four  wave  functions  associated  with  it: 


=  Ko.o  =  R»  (r)  Y:  =      =  Rn  (r),  (14.35) 


$3  =  Ki.o  ==  #21  (r)  Y\  =  y  /-  RH  (r)  cos  ft,  (14.36) 

!  =  <hu  =  #a  (0  >'l  =  "[/"^  ^?n  (0  ^F=  e'9>  (14.37) 

Rtl(r)™±e-'.        (14.38) 

When  0  and  9  are  replaced  by  Cartesian  coordinates,  these  wave 
functions  take  the  form: 

W=/iW,  (14.35a) 

f2=/2  (/•)*,  (14.36a) 

(I>1  =r  / ,  / r"\  —'-— ^t '-^  -  /1J.    Q7o\ 

YJ      /2V/     ^~    ,  ^i^t.o/a; 

<K  =  — /j  (0  — 7  ,^ »  (14.38a) 

where 

(14.39) 
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The  general  wave  function  of  an  electron  will  have  the  form 

4 


(14.40) 


Since   the   degree   of  degeneracy   is    four  (/  =  4),  we  have  the 

following    system    of  four  equations  for  determining  the  unknown 

coefficients   E\  and  the  corrections   C!  to  the  energy  E\  of  the  un- 
perturbed state: 


i  -  v;3)  -  avM  =  o, 
-  ci'i/;,  -  cs  v;t  -  civ*  +  Q  (/•;  -  \/;4)  -  o, 


where 


*2*!«rjc.  (14.42) 

When  we  integrate  over  the  volume,  the  matrix  elements 

t'n.  va'2,  v;3l  v«,  v;3t  v;4,  v;,,  \/2;,  v;lf  v;2,  v/;4,  v;t,  v;2  ,,n<i  v;3 

vanish,  since  for  each  of  them  the  integrand  must  be  an  odd  func- 
tion with  respect  to  at  least  one  of  the  coordinates  x,  y  and  z.  Only 
the  matrix  elements 

V'u     and      V'n  =  V'U9 
which  are  even  functions  of  all  three  coordinates,  differ  from  zero 

V»=Vrii  =  «.8  J/i  (Of.  (O^1^.  (14.43) 

Let  us  substitute  the  values  of  Eq.  (14.39)  for  f{  (r)  and  f2  (r)  and 
note  that,  according  to  Eq.  (13.28a), 


and 
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Then,    integrating   (14.43)  over  the  angles  $  and  ?  (remembering 
that  z  =  r  cos  0),  we  obtain 


(14.43a) 


Next,  taking  into  account  the  equality 

00 


we  obtain 


(14.44) 


Using  the  obtained  values  of  the  matrix  elements  Vi<t,  we  find, 
in  accordance  with  (14.31),  that  the  secular  equation  for  the  correc- 
tions E't  is 


.§00 
E\       0      0 

o    £•;     o 

0     0      E\ 


=  0, 


(14.45) 


which  can  be  rewritten  in  the  form 


This  equation  has  four  roots  : 


(14.45a) 


(14.46) 


each  one  of  which,  according  to  Eq.  (14.41),  is  associated  with  a 
quite  specific  set  of  coefficients: 


C2>  
J      


(14.47) 


Here  the  superscript  /  of  the  coefficients  O/1  =  C\  indicates  the  so- 
lution (or  root)  of  Eq.  (14.45a)  with  which  it  is  associated.  Thus,  it 
follows  from  (14,40)  and  (14.47)  that  the  energy  level 


(14.48) 
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is  associated  in  the  zero-order  approximation  with  the  wave  func- 
tion 


Because  of  the  normalization  condition 


this  wave  function  becomes 

t°(1)=y^(^.o  +  *i.M).  (14.49) 

In  a  similar  fashion,  it  can  be  shown  that  the  state  with  energy 

(14.50) 


is  described  by  the  zero-order  wave  function 

°(i) 


To  describe  the  states  with  energy 


which  remain  unperturbed  by  the  electric  field  in  the  first-order 
approximation,  we  may  equally  well  take  the  function 


or 


or,  alternatively,  we  may  take  a  linear  combination  of  these  func- 
tions, since  the  system  remains  degenerate  for  m  =  +  1  even  in  the 
presence  of  an  electric  field.  Thus,  when  the  z  component  of  the 
angular  momentum  is  not  equal  to  zero  (m  =  ±  1  in  units  of  #),  that 
is,  the  electron  is  moving  largely  in  the  xy  plane  (see  Chapter  12, 
Section  A),  there  will  be  no  splitting  of  the  energy  levels  (and, 
therefore,  of  spectral  lines)  in  an  electric  field.  If,  on  the  other 
hand,  the  orientation  of  the  angular  momentum  is  such  that  its  2 
component  is  equal  to  zero  m  =  Q  and,  consequently,  the  electron 


244  NONRELATIVISTIC    QUANTUM     MECHANICS 

is  moving  in  a  plane  which  includes  the  z  axis  (again,  see  Chapter 
12,  Section  A),  then  an  electric  field  does  lead  to  splitting  of  the 
spectral  lines  (see  Fig.  14.1). 


Rh 

'  4 


14.  I     Splitting  of  the  second  spectral  term  of  hydrogen  m 
an  electric  field  (the  linear  Stark  effect): 

a)  enorgy  level  in  the  absence  of  a  field  (£  ~-0), 

b)  energy  level  in  the  presence  of  a  field 


Qaulitatively,  the  Stark  effect  for  n  =  2  may  be  interpreted  as 
follows.  Since  the  wave  function  describing  the  motion  of  the 
electron  for  n  =  2  (see  Fig.  12.1)  is  not  centrally  symmetric,  the 
atom  has  a  certain  electric  moment  p.  Consequently,  when  the 
atom  is  placed  in  an  electric  field 


it  acquires  an  additional  energy 

V"  =  —  (p£)  =  —  pgcosTf  (14.52) 


where  7  is  the  angle  between  the  direction  of  the  electric  dipole 
moment  of  the  atom  and  the  z  axis.  Comparing  this  expression 
with  (14.46),  we  see  that  the  electric  dipole  moment  of  the  atom 
is  p  =  3al}eQ.  The  solution  ty"  u)  corresponds  to  the  case  7  =  0  and 
the  solution  <pola)to  the  case  f  =  rc.  For  the  third  and  fourth  solu- 

tions, it  is  necessary  to  set  f  =  :li^.  We  note  that,  in  the  last  case, 

the  electric  dipole  moment  is  oriented  perpendicularly  to  the  elec- 
tric field  and,  consequently,  no  additional  energy  appears.  To  con- 
clude this  discussion,  the  linear  Stark  effect  arises  because  of  the 
intrinsic  electric  dipole  moment  of  the  hydrogen  atom  at  n  =  2. 

Predictions  obtained  on  the  basis  of  quantum  mechanics  are  in 
good  agreement  with  experimental  data  only  in  the  case  of  weak 
fields  (10*  volts/cm).  At  higher  field  intensities  (^10  '  volts  /cm), 
there  is  an  additional  splitting  (the  quadratic  Stark  effect)  due  to 
the  removal  of  the  degeneracy  with  respect  to  the  magnetic  quan- 
tum number  m.  Finally,  at  field  intensities  greater  than  10*  volts/ 
cm,  the  Stark  effect  completely  disappears.  This  is  the  result  of 
autoionization  of  the  atoms,  that  is,  removal  of  electrons  from  the 
excited  levels. 
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F.    PRINCIPLES  OF  THE  CLASSICAL  THEORY  OF 
DISPERSION 

Perturbation  theory  has  many  important  applications  in  studies 
of  the  interaction  between  light  and  matter.  The  predictions  obtained 
in  this  way  differ  from  classical  results  and  receive  excellent 
confirmation  from  experiment.  This  section  is  concerned  with  the 
classical  theory  of  dispersion  (that  is,  the  scattering  of  light)  in 
a  dielectric  medium.  According  to  classical  notions,  a  dielectric 
is  characterized  by  the  index  of  refraction 


where  £  is  the  dielectric  permittivity  (the  magnetic  permeability 
(A  is  taken  to  be  equal  to  unity).  If  the  index  of  refraction  n  becomes 

larger   as   the   frequency  of  light  increases  (that  is,  ^>0),the 

dispersion  is  said  to  be  normal.  A  typical  example  of  normal 
dispersion  is  the  spectral  resolution  of  white  light  by  a  glass  or 
quartz  prism  (the  deflection  of  violet  rays  from  their  initial  direc- 

tion is  larger  than  that  of  red  rays).  If,  however,  ~  <0in  a  certain 

range  of  frequencies,  the  dispersion  in  this  region  is  said  to  be 
anomalous.  As  a  rule,  anomalous  dispersion  occurs  at  frequencies 
at  which  light  is  absorbed  by  the  medium. 

To  determine  the  index  of  refraction  (one  of  the  most  important 
problems  in  the  theory  of  dispersion),  we  use  the  equation  relating 
the  electric  field  intensity  E,  the  displacement  vector  Z>,  and  the 
polarization  vector  P4 

D  =  e£  =  E  +  4*P.  (14.53) 

Since  e  =  /^,  we  have 

P=^E.  (14.53a) 

Thus,  to  determine  n  we  are  required  to  find  the  relationship 
between  P  and  E  on  the  basis  of  the  microscopic  picture  of  the 
structure  of  matter.  We  shall  be  better  able  to  appreciate  the 
contribution  of  quantum  theory  in  this  connection  after  we  have 
completed  our  review  of  the  basic  principles  of  the  classical  theory 
of  dispersion. 

According  to  the  Lorentz  classical  theory,  atoms  may  be 
regarded  as  harmonic  oscillators  in  which,  in  the  simplest  case, 
all  the  electrons  oscillate  with  the  same  angular  frequency  u>.  If 


4The  polarization  P  is  defined  as  the  total  dipole  moment  of  the  atoms  per  unit  volume 
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we  take  the  z  axis  to  be  parallel  to  the  direction  of  propagation  of 
the  electromagnetic  wave,  then,  since  the  wave  is  transverse,  we 
may  direct  vector  E  along  the  x  axis  (£,  =  $,  £^  =  £,  =  0),  and 
vector  H  along  the  y  axis.  If  we  neglect  the  force  exerted  on  the 
atomic  electrons  by  the  magnetic  field  (since  the  magnitude  of  this 

force   is   only  a  fraction  (~\  of  the  force  exerted  by  the  electric 

field),  the  oscillation  of  the  electrons  can  be  described  by  the 
equation 

x  =  —  £o§-  (14.  54) 


We  shall  assume  that  the  frequency  of  the  incident  light  wave  is   u>. 
Then 


a  =  §0  cos  w  -  -*-  .  (14.55) 

If  the  energy  transported  by  the  wave  is  much  smaller  than  the 
bonding  energy  of  the  electrons  in  the  atom,  it  follows  that  the 

ratio  -  may   be  neglected  because   it   is   small  compared  to  the 

atomic  dimensions  (x~  a~  10~8  cm,  while  the  wavelength  A  is  of  the 
order  of  10~5  cm).  Thus,  under  the  assumptions  we  have  made,  the 
electric  field  of  the  wave  may  be  considered  to  be  quasi-  stationary 
inside  the  atom.  As  a  result  of  this  simplification,  Eq.  (14.54) 
becomes 


-=  —  ^  g0  cos  cot  .  (14.54a) 


Multiplying   (14.54a)  by  (—eQN),  where  N  is  the  number  of  atoms 
per  unit  volume,  and  substituting 

—  e^x  =  pxt  and     Px  =  Npx  =  &,      Py  =  Pz=:Ot 
we  reduce  this  equation  to  the  form 

ffi  -{-  «>;$*  =  —  So  cos  w/,  (14.54b) 

from  which  we  obtain 


Comparing  Eqs.  (14.56)  and  (14.53a),  we  obtain  an  equation  for  the 
index  of  refraction  which  should  be  familiar  from  optics: 


Wo 
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We  note  that  if  the  atom 
different  eigenfrequencies 


is  assumed  to  contain  electrons  with 


a  more  general  equation  is  obtained  instead  of  (14.57): 


/*»  — 1 


(14.57a) 


where  A^  is  the  number  of  electrons  per  unit  volume  oscillating 
with  the  frequency  vk. 

It  follows  from  (14.57)  that  at  radio  frequencies  (o><a>0)the 
index  of  refraction  may  be  assumed  to  be  constant,  without  signif- 
icantly affecting  the  accuracy.  Its  value  is  given  by  the  relation 


Ne* 


(14.58) 


On  the  other  hand,  for  frequencies  <o>o>0,  the  value  of  the  index  of 
refraction  is  given  by  the  relation 


Ne* 


471 


(14.59) 


The  index  of  refraction  is,  therefore,  a  constant  greater  than  unity 
for  o><<to0 ,  whereas  for  w^vu>0  it  is  less  than  unity,  approaching  it 
as  o>  —  CQ. 

At  frequencies  close  to  <DO,  the  magnitude  of  the  index  of  refrac- 
tion increases  without  limit,  and  at  o>  =  o>0  it  has  a  discontinuity 
(see  Fig.  14.2).  The  reason  for  this  be- 
havior of  the  function  is  that  Eq.  (14.54) 
does  not  include  the  radiation  damping  of 

the  electron  /-"damp—,  _r_  _!«,  #,  which  arises 

from   the   interaction   between  the  moving 

electron  and  its  own  field.  When  /^amp   is 

included,  the  dispersion  curve  has  the  form 

indicated   by   the  dotted  line  in  Fig.  14.2. 

Consequently,  the  dispersion  is  anomalous 

near  the  resonance  frequency  u>0 .  Since  the 

region   of  anomalous  dispersion  coincides      Persion  curve 

with  the  region  of  the  eigenfrequencies  of 

oscillation   of  electrons    in  the   atom,    it   follows  that  anomalous 

dispersion  is  accompanied  by  strong  absorption. 


° 


r 


Fig.  14.2.    Classical 


Ncg 
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F.     QUANTUM  THEORY  OF  DISPERSION 

Let  us  now  develop  a  quantum  theory  of  dispersion.  By  analogy 
with  the  classical  case,  we  shall  assume  that  all  the  electrons  in 
the  atoms  are  in  the  same  quantum  state  kf  We  shall  use  the 
perturbation  method  to  solve  our  problem,  since  the  energy  of  the 
interaction  with  the  external  field  is  generally  small  compared 
with  the  bonding  energy  of  the  electrons.  In  the  nonrelativistic  case 
(when  we  may  neglect  the  "magnetic"  force),  the  external  force 
which  acts  on  an  electron  can  be  obtained  from  Eq.  (14.55)6: 


The  perturbation  energy  is  given  by 

V'  =  e0xg0cosa>/.  (14.60) 

Consequently,  the  Schrodinger  equation  for  the  electron  is 


O.  (14.61) 

Let  us  suppose  that  Eq.  (14.61)  has  an  exact  solution  for  l/'  =  0: 

<pi(0  =  tfc-(l/n)V  ==^-H.^f  (14.62) 
where  <J$  and£*  satisfy  the  equation 

(E,-H°)<j*  =  0.  (14.63) 

In  accordance  with  perturbation  theory,  we  shall  look  for  a  solution 
in  the  form 


0-  (14.64) 

Since  (14.62)  and  (14.63)  yield 

(-7  'a--"' 


This  assumption  is  analogous  to  the  assumption  made  in  the  classical  treatment, 
according  to  which  all  the  electrons  have  the  same  eigenfrequencies  of  oscillation. 

Just  as  in  the  classical  case,  we  have  assumed  here  that  the  electric  field  is  quasi- 
stationary  over  distances  of  the  order  of  the  dimensions  of  the  atom. 
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the  equation  for  <]>i  (0  and  the  first-order  correction  to  the  energy 
Ekis 


Substituting  V  from  (14.60),  we  have 

^tfc"*"*)  +  ^ft(-*+")}'     (14-65a> 


To  eliminate  the  time  /  from  this  equation,  let  us  look  for  a  solu- 
tion (|»i  (/)  in  the  form 

^  (/)  =  ue-  u  <«,-«)  _(_  or-  «  («+«*>  .  (14.66) 

\Ve  then  have  the  following  equations  for  the  functions  u  and  v  : 

{H  (^  __  W)  _  H'}  u  ==  1  c«*8.«,  (14.67) 

{^  (o)A  +  01)  -  H()}  »  =  1  cojcfctl.  (14.68) 

We  note  that  these  two  equations  have  exactly  the  same  form. 
Consequently,  it  is  only  necessary  to  solve  Eq.  (14.67)  for  «,  since 
the  solution  for  v  of  Eq.  (14,68)  can  be  obtained  from  u  by  substi- 
tuting —  u>  for  w.  Since  the  time  does  not  appear  explicitly  in  Eq. 
(14.67),  we  can  find  u  by  the  perturbation  method  in  the  form  ap- 
plicable to  stationary  problems.  Thus  we  shall  look  for  a  solution 
in  the  form  of  an  expansion  in  eigenfunctions  of  the  unperturbed 
problem  [see  (14.8)]: 


**'  (14.69) 

where  the  <|4"  satisfy  the  equation 

(£^-H'')<p2''==0.  (14.70) 

Accordingly,  we  may  reduce  (14.57)  to  the  form 


*•  (14.67a) 

where  the  frequency  of  radiation  is 

^,.=^1^1.  (14.71) 
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Let   us  multiply  (14.67a)  on  the  left  by  <K*  and  integrate  over  all 
space,  taking  into  account  the  orthonormality  of  the  eigenf unctions 


We  then  obtain  the  following  equations  for  the 
coefficients  CV  and  the  function  u : 

— ,  (14.72) 

~  **'.  (14.73) 

where  the  matrix  element  xvk  is  equal  to 

:.  (14.74) 


Substituting  —  u>  for  u>    in  (14.73),  we  obtain  an  expression  for  the 
function  v: 


<14'75) 


From    (14.64),   (14.66),   (14.73)    and  (14.75),    it   follows   that   the 
total  wave  function  tyk  (t)  is 


—  —  I3-  ^  J*^!  N'*  cos  ^  —  ico  sin  orf]|  .  (14.76) 


From  the  wave  function  ^  (f  )  of  the  electron  in  the  external 
field,  we  can  readily  obtain  the  polarization  vector  of  the  medium 
<^.  In  the  classical  theory  we  had 


To  generalize  this  expression  to  the  quantum  case,  we  must  replace 
p  by  its  average  value.  Then 

.  (14.77) 

Substituting  <|>fc(f)  from  (14.76)  and  retaining  only  first-order  terms 
in  g0»  we  have 
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In  deriving  this  expression,  we  used  the  relation 


which  follows  from  the  fact  that  the  integrand  is  an  odd  function  of 
x.  Comparing  Eqs.  (14.78)  and  (14.53a),  we  obtain  the  dispersion 
formula 


(14.79) 


4  w 


By  introducing  the  new  variable 


(14,80) 


which  is  called  the  oscillator  force,  we  transform  Eq.  (14.79)  to 


Nel 


(14.81) 


Here  let  us  make  an  observation  similar  to  the  one  made  in  regard 
to  the  classical  treatment:  namely,  if  we  had  included  the  radiation 
damping  in  the  quantum-mechanical  treatment,  we  should  have 
obtained  a  finite  value  of  ri*  for  frequencies  o>  in  the  neighborhood 
of  uk'k  (see  Fig.  14.3af  the  dotted  line). 


/7*-f 


n*-1 


UJ 


a  b 

Fig.   14.3.  Dispersion  curves 

a)  positive  dispersion  C^/c"  ^/c'/c)' 

b)  negative  dispersion  (a>ft~&>fcJk') 

Equation  (14.81)  has  a  structure  similar  to  the  classical  equa- 
tion (14.57).  In  actual  fact,  however,  the  quantum  results  are 
fundamentally  different  from  the  classical  results.  From  quantum 
theory  it  follows  that  anomalous  dispersion  occurs  in  the  neighbor- 
hood of  frequencies  corresponding  to  allowed  transitions,  and  not, 
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as  in  classical  theory,  in  the  neighborhood  of  the  eigenfrequencies 
of  oscillation  of  the  electrons.  This  particular  conclusion  can  be 
seen  to  be  directly  related  to  the  role  of  the  oscillator  force  fvu 
in  (14.81),  which  is  specified  by  the  matrix  element  xvk  [see  (14.80)] 
which  characterizes  the  selection  rules  (and  thus  the  allowed 
transitions).  This  prediction  of  quantum  theory  was  experimentally 
verified  by  D.  S.  Rozhdestvenskiy. 

A  second,  very  important  difference  from  the  classical  results  is 
that  quantum  theory  leads  to  negative  dispersion  (see  Fig.  14.3b)—a 
phenomenon  which  has  no  classical  analog.  This  can  be  under- 
stood by  simply  noting  that  when  light  is  scattered  by  excited 
atoms,  it  is  necessary  to  take  into  account  the  states  with  Ek^>Ekf 
for  which 


For  these  states  the  dispersion  formula  (14.81)  becomes 

*=±  =  -  *i  V  _!4*L  (1 

47i     —         w0     7^  ufo  —  co*  '  ll 

k' 

and  the  dispersion  curve  is  represented  by  the  dotted  line  in  Fig. 
14.3b.  The  experimental  discovery  of  negative  dispersion  was  made 
by  Ladenburg;  thus,  this  prediction  of  quantum  theory  was  also 
confirmed. 

Let  us  now  find  the  value  of  the  oscillator  force  /v*  and  the 
dispersion  formula  for  a  harmonic  oscillator.  The  only  nonvanish- 
ing  matrix  elements  in  this  case  are  [see  (10.55)] 


,       _  -, 

and  **_,..-  / 


"By  chance,"  it  turns  out  that  the  quantum-mechanical  frequencies 
of  radiation  are  identical  with  the  eigenfrequencies  of  oscilla- 
tion 

^ii,  *  =  l°«  and  <°k-l,  *  =  —  wo. 

We  thus  obtain 

/*»!.*  =  (*+  1),    /*-!.*  =  -*.  (14082) 

Consequently  (sinceY  f*.k=  IK  the  dispersion  formula  (14,81)  can 

\         *'  / 

be  written  as 
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mQ    cog  — 


(14.83) 


We  can  see  that  in  this  particular  problem  the  quantum  and  clas- 
sical theories  yield  the  same  value  for  the  index  of  refraction  n. 
The  phenomenon  of  negative  dispersion  is  not  observed.  The 
reason  for  this  is  that  the  regions  of  positive  and  negative  disper- 
sion coincide  since  |«>*,i,ft|i=:=|««»*  i,  J",  so  that  the  stronger  effect  of 
positive  dispersion  masks  the  negative  dispersion. 


hw 


ficu' 


hw 


ID 


G.    RAMAN  EFFECT 

Let  us  consider  the  phenomenon  of  dispersion  from  the  stand- 
point of  energy  diagrams.  Suppose  that  a  photon  with  energy 

6  =  *(o  (14.84) 

impinges    on  an  atom  with  only  three  energy  levels  Ek"  <£*<  Ev 
(see  Fig.   14.4).    In  general,  the  scattering  of  this  photon  (that  is, 
dispersion)  will  be  a  second-order  effect.   The  first  form  which 
this  process  can  take  is  absorp- 
tion of  the  photon.  This  is  ac-  / 
companied   by  excitation  of  an 
electron  from  level  k  to  some 
intermediate  state  (which  may 
even  be  a  forbidden  state7;  see 
Fig.  14.4,1)  and,  subsequently, 
by  emission  of  a  photon.  If,  as 
a    result,  the  electron  returns 
to   its    initial   state,  it  follows 
from  the  law  of  conservation  of 
energy  that  the  frequency  <•>'  of 
the  scattered  photon  is  the  same 
as  the  frequency  «>  of  the  inci- 
dent photon. 

Alternatively,  the  order  of 
the  process  may  be  reversed: 
the  atom  first  emits  a  photon 
(see  Fig.  14.4,11)  and  then  ab- 
sorbs the  incident  photon.  As 

in  the  preceding  case,  the  frequency  a/  of  the  scattered  photon  will 
be  equal  to  the  frequency  o>  of  the  incident  photon  if  the  atom  re- 
turns to  its  initial  state. 


n 

Fig.     1 1.4.    Energy    diagram    for    photon 

scattering 

h(o  is  the  energy  of  the  incident  photon, 
tl«)  ,  the  energy  of  the  scattered  photon, 
I  and  II  represent  elastic  scattering 
(hu>£ha)kik  and  hw/-fla)kk"),  III  and  IV 
represent  induced  transitions  (hat^fia}^ k 


More   precisely,    the   law   of  conservation   of  energy   may   be   violated   in   intermediate 
states    It  is  required  to  hold  only  in  the  final  result 
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Finally,  resonance  occurs  when  w^wwv*.  In  this  case,  both 
processes—  scattering  and  absorption  of  the  photons—  take  place 
(see  Fig,  14.4,  III);  as  a  result  of  the  last  process  the  electrons  in 
the  atom  undergo  induced  transitions.  The  probability  of  these 

transitions  is  given  by  the  Einstein 
coefficient  Bk*  [see  (9.21)].  An 
external  field  increases  the  number 
of  downward  transitions  (see  Fig. 
14.4,  IV),  which  results  in  some 
additional  radiation  proportional  to 
the  coefficient  B**». 

So  far,  we  have  been  concerned 
with  cases  in  which  atoms  return  to 
their  initial  state  after  scattering. 
It  may  happen,  however,  that  after 
the  atom  has  absorbed  the  incident 
photon,  the  electron  does  not  return 
from  the  intermediate  state  to  the 
level  £,  but  instead  makes  a  transi- 

tion to  the  level  k'  or  k"  (see  Fig.  14.5).  In  this  case,  the  frequency 
of  the  scattered  light  («/  or  «/')  is  not  equal  to  the  frequency  of  the 
incident  light.  This  type  of  scattering  is  called  the  Raman  effect, 
after  the  Indian  physicist  who  first  discovered  this  phenomenon  in 
liquids.  In  solids  the  Raman  effect  was  discovered  by  the  Soviet 
physicists  L.  I.  MandePshtam  and  G.  S.  Landsberg  (1928). 

From  Fig.  14.5,  it  can  be  seen  that  the  frequency  of  the  scattered 
photon  may  be  either  lower  or  higher  than  the  frequency  of  the 
incident  photon.  In  the  former  case,  the  lines 


Fig.  14.5.  The  Raman  effect. 
hw  is  the  energy  of  the  incident 
photon,  ftea't  the  energy  of  the  scat- 
tered photon  corresponding  to  "Stokes" 
lines;  and  hu",  the  energy  of  the 
scattered  photon  corresponding  to 
"anti- Stokes"  lines 


U    =  (D  -  ( 


correspond  to  excitation  of  the  atom,  since  the  atom  ends  up  in  a 
higher  energy  state.  These  lines  are  known  as  "Stokes"  lines  (the 
levels  are  shifted  towards  the  red  part  of  the  spectrum).  The 
second  case  corresponds  to  "anti-Stokes"  lines  (shifting  towards 
the  violet  part  of  the  spectrum): 


u>"  =  0)  -{- 


co; 


these  lines  appear  only  when  the  light  is  scattered  by  excited  atoms. 
It  is  obvious  that  at  low  temperatures  only  Stokes  lines  can  be 
observed.  As  the  temperature  increases  and  the  atoms  of  the 
substance  begin  to  undergo  transitions  to  excited  states,  anti- 
Stokes  lines  appear. 

The    Raman  effect  provides   much  important   information   in 
studies  of  molecular  structure.    In  Chapter  12,  Section  C,  we  saw 
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that  the  rotational  and  vibrational  levels  (and  also  the  vibrational- 

rotational  levels),  which  provide  data  on  molecular  structure,  are 

all  located  in  the  far  infrared  region  of 

the   spectrum  and  are  very  difficult  to 

observe«  In  studies  of  the  Raman  effect,  ,  -  •  -  , 

it   is   possible   to  use   visible   light   in          a  j>  - 

determining    molecular   spectra,    since  w'  w" 

these    spectra   are   superposed   on  the 

lines    in  the   spectrum   of  the  incident 

light.  The  experimental  values  of  a/  and 

G>"     (See     Fig.    14.6)    yield    the    molecular 
frequencies 


\          I 


u>k,k  =  a)  —  (o'  and  a)**''  =  <o"  —  (0, 
/.  i  .    !      ,,  -,        .  .  ,  , 

from    which  the  selection  rules  can  be 
derived. 


Flg.  I4>6.  superposition  of  the 

molecular  frequencies  on  the 
frequency  of  incident  light: 
a)  spectral  line  a)  in  the  ab- 
sence  of  molecular  oscillations, 
and  6)  shift  of  the  spectral  line 
due  to  molecular  oscillations 


^Problem  14.1.J  Find  the  energy  correction  for  a  system  in  second-order  perturbation 
theory.         u*««w 

Solution.  Including  the  terms  up  to  and  including  the  second  order  in  the  expansions  of 
the  wave  function  '\>  (14.3)  and  the  energy  E  (14.4),  and  substituting  these  expansions  into 
the  Schrodmger  equation  (14.2a),  we  obtain  the  equation 

(/:  °  -  H°)  ^  =  -  (E'n  -  K')K  -  <K- 

Since  the  solution  <|£*  of  the  homogeneous  equation^  —  H°)  4*^=0  is  orthogonal  to  the 
right-hand  side  and  since  we  can  substitute  the  expression  (14.22)  for  ^f  we  have 


The  value  of  Vn,n  is  given  by  (  14.  15),  and  we  have  used  the  relationship 

v'  ,  =  v'*, 

v  nn  n  n* 

which  holds  for  Herrmn  an  operators. 

We  note  that  the  second-order  correction  (14.85)  to  the  energy  of  the  ground  state  is 
always   negative,   since   all   the   other  levels    En'  are  higher  than  the  level  £j,  that  is, 


Using   the   results  of  the  perturbation  theory,  find  the  energy  of  the 
ator  including  the  terms  up  to  the  order  of  fia;  take  the  Hamiltonian  of 


^  Erern__l4. 
arih"aTrn6nIc""o2ci 
the  system  to  be  equal  to 


where  V  =  ax*  +  P-^4  (&e  constants  a  and  P  are  classical  quantities),, 
Solution.  The  energy  of  a  harmonic  oscillator  (V  =  0)  is 

/£  =  *«(/! +>/.). 

Taking  V}  as  the  perturbation  energy,  the  first-order  approximation  gives 

En  =Vnn^*  (x 
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It  can  easily  be  shown  that 


«*U=Jl*. 


since  the  integrand  is  an  odd  function. 

To  calculate  the  matrix  elements  P  (x*)nn  »  we  use  ^e  multiplication  rule  for  matrix 
elements  (see  Problem  10.4),  obtaining 


Substituting   the  value   of  (x*)nk  from  Problem  10.5,  we  obtain  the  following  expression 
for  the  first  approximation  of  the  perturbation  energy  En". 


Our  problem,  however,  is  not  yet  fully  solved,  since  in  the  second-order  approximation 
there  is  a  contribution  proportional  to  —;-  ~  #a  which  arises  from  the  first  term  of 
the  perturbation  energy  a,v3,  and  we  must  take  this  contribution  into  account.  The 

A'14 

second-order  contribution  from  the  termjU'1  is  proportional  to-^"-~  H\  and  accordingly 

it  may  be  neglected  in  our  approximation.  The  second-order  correction  arising  from 
the  first  term  of  the  perturbation  energy  can  be  calculated  from  Eq.  (14.85): 

._    «"'      \(x*)nn'(x*)n'n 

~       "'    "' 


The  only  nonvanishing  matrix  elements  are  [see  Eq.  (10.55)  and  Problem  10.5] 


__ 

n  (n  —  1)  (n  —  2) 

^  -  ^       -  -; 


where  A*O  = 
Hence 


Part  II 
Relativistic  Quantum  Mechanics 


Chapter  15 

The  Klein-Gordon  Scalar  Relativistic 
Wave  Equation 


RELATIVISTIC  MECHANICS  AND  THE  KLEIN-GORDON 
EQUATION.  RELATIVISTIC  INVARIANCE 

The  Schrodinger  wave  equation  is  nonrelativistic:  it  is  suitable 
only  for  particles  whose  velocity  v  is  much  smaller  than  the 
velocity  of  light  c.  It^is  not  invariant  with  respect  to  the  Lorentz 
;ransformations  of  tlie  special  theory  of  relativity  since~there  IS 
in  asymmetry  between  the  time  and  space  coordinates^  (the/ 
equation  contains  a  first  derivative  with  respect  to  time,  and 
second  derivatives  with  respect  to  the  space  coordinates).  Ac- 
cording to  the  special  theory  of  relativity,  it  is  necessary  for  the 
time  and  space  coordinates  to  be  treated  on  the  same  basis. 

It  is  interesting  to  note  that  the  de  Broglie  relations 

p  =  Uk>    E  =  Uu  (15.1) 

are    relatively    invariant.     In  the   Lorentz  transformation,   they 
behave  like  a  four-vector  p^  with  components 


Pli  ,,,=/>,    ft=£.  (15.2) 

This  indicates  that  it  is  possible  to  generalize  quantum  mechanics 
to  the  case  of  particles  traveling  with  a  velocity  of  the  order  of 
the  velocity  of  light. 

A  method  of  extending  the  nonrelativistic  wave  equation  in  a 
way  consistent  with  the  special  theory  of  relativity  was  proposed 
by  Klein  and  Gordon  in  1926.  (This  method  was  also  put  forward 
by  Schrodinger  and  by  Fok.)  The  simplest  way  of  obtaining  the 
Klein-Gordon  equation  consists  in  taking  the  relativistic  relation- 
ship between  the  energy  £,  momentum  p,  and  rest  mass  m0  of  a 
free  particle 

£*  —  <?V  —  mjc4=0,  (15.3) 
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substituting  the  energy  and  momentum  operators 

E  =  tt-^.,   p  =  —  iftV,  (15.4) 

which   act  on  the  wave  function   <|>(r,  t).    Replacing  m0  by  Uk^c  and 
dividing  by  h2c'2,  we  obtain1 


=   .  (15.5) 

Here 

*,:=:*,    x*~y,    x<  =  z,    xi  =  ict  (15.6) 

(a  double  occurrence  of  the  subscript  p,  in  a  term  indicates  that  it 
should  be  summed  from  1  to  4).  Since  our  initial  equation  was  the 
relativistic  relationship  (15.3),  Eq.  (15.6)  is  relativistically  in- 
variant and,  therefore,  it  is  symmetric  with  respect  to  the  space 
and  time  coordinates.  We  shall  not  attempt  to  prove  the  invariance 
of  the  Klein-Gordon  Equation  more  rigorously,  and  shall  now  pro- 
ceed to  examine  its  properties.  ^ 


B*yTHE  CHARGE  AND  CURRENT  DENSITY 

As  in  the  nonrelativistic  theory,  equations  for  the  charge  and 
current  density  can  be  obtained  on  the  basis  of  the  equation  of 
continuity 


We  multiply  (15.5)  on  the  left  by  •]»*,  and  the  complex  conjugate 
of  this  equation  [that  is,  Eq.  (15.5)  with  <1>*  substituted  for  <!>  ]  by 
t  .  Then  we  subtract  the  second  of  the  resulting  equations  from 
the  first 


(15.8) 
After  some  simple  transformations,  Eq.  (15.8)  becomes 

=0.  (15.9) 


Here  and  in  the  subsequent  chapters,  we  shall  not  write  ^  as  a  function  of  t,  as  was 
done  m  the  Schodmger  theory.  In  the  wave  equation  for  monochromatic  (K  ~  const)  waves, 
for  which  only  the  time-independent  part  of  the  0  function  must  be  considered,  we  shall 
use  the  energy  eigenvalue  instead  of  the  operator. 
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Defining  the  charge  density  and  the  current  density  as 


note  that  these  expressions  satisfy  the  equation  of  continuity 
•7).  Moreover,  they  define  a  four-dimensional  vector 


ere 

Xi  =  ict,    ]\—icp.  (15.13) 

e  current  density  (15.11)  is  identical  with  the  nonrelativistic 
jression  (5.21),  and  the  charge  density  (15.10)  reduces  to  the 

irelativistic  expression  (5.20)  when  i/*<c.  Substituting  ih  d£  —  E 
e  (15.4)]  into  (15.10),  we  obtain 

p  =  ^V^<]),  (15.14) 

r  7M0CJ    T     T  * 

ich  becomes  the  usual  expression  p  =  e^*ij>  in  the  nonrelativistic 
)roximation  E~m($*m  Thus  we  have  selected  a  normalization  in 
ich  the  relativistic  values  of  p  and  /  reduce  to  the  corresponding 

irelativistic  expressions  when  (  ^-J  <;  1  . 

It   is    worth   noting   that   the  definition  of  the  particle  density 
distinguished  from  the  charge  density) 


es    rise   to    some   difficulties    in   the    relativistic    theory.  The 
sin-Gordon  wave  equation  is  a  second-order  differential  equation 

1,    therefore,  both  <|>  and  d^    can  be  arbitrarily  defined  at  some 

dt 

-en  time  t.  Consequently,  the  density  pn  (15.15)  is  not  positively 
*ined,  unlike  the  nonrelativistic  probability  density 

p.  =  **t-  (15.16) 
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Accordingly,  the  expression  (15.15)  cannot  be  interpreted  as  the 
particle  density  (that  is,  the  "number  of  particles  per  unit 
volume").  The  underlying  reason  for  this  is  that  the  same  rela- 
tivistic  equation  describes  particles  with  either  positive  or  nega- 
tive charge  (and,  indeed,  *  mesons,  to  which  the  Klein-Gordon 
equation  is  applicable,  may  be  either  positive  or  negative  in  charge). 
The  quantity  p0  ,  therefore,  can  have  both  signs. 

C.    RELATIVISTIC  THEORY  OF  THE  HYDROGEN  ATOM 
(NEGLECTING  THE  ELECTRON  SPIN) 

To  treat  the  interaction  of  a  particle  with  an  electromagnetic 
field  (defined  as  usual  by  a  vector  potential  A  and  a  scalar  po- 
tential <t>),  we  introduce  the  same  operators  as  in  the  nonrela- 
tivistic  case 

(15.17) 
From  (15.3),  we  can  obtain  the  Klein-Gordon  equation2 


We  shall  use  this  equation  to  study  the  spectrum  of  the  hydrogen- 
like  atom.  Setting  4=0.  and  V=e®  =  —  ~*  in  (15.19),  we  have 

^  +  ^v  {(E-lO'-mJcn^O.  (15.20) 

Since  the  potential  energy  does  not  depend  on  time  in  this  equation, 
we  can  transform  to  the  time-independent  case  by  means  of  the 
substitution 


In  this  equation,  we  have  not  included  the  rest  mass  energy  m^  of 
the  particle  in  the  energy  E.  As  a  result,  Eq.  (15.20)  becomes 

=  °-  (15.21) 


This  equation  can  also  be  obtained  from  the  relativistic  Hamiltonian  for  a  particle  in 
an  electromagnetic  field 


"'O^     *  €®  •  (15  18) 

It  is  only  necessary  to  transfer  <?$  to  the  left-hand  side,  square  both  sides,  and  replace 
p  and  A  by  their  quantum-mechanical  operators. 
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Just  as  in  the  Schrodinger  theory,  we  shall  look  for  a  solution  to 
this  equation  in  the  form 

»,?).  (15.22) 


The  equation  for  the  radial  part  is 

=o.  (15.23) 


Here   a  =  -t|-^y~    is   a   dimensionless   quantity,   called  the  fine 
structure  constant,  and 


When  c*--  -oo,  the  expressions  (15.24)  reduce  exactly  to  their  non- 
relativistic  counterparts  (see  Chapter  13). 

The  somewhat  improved  values  that  we  have  obtained  for  A 
and  B  by  taking  into  account  the  relativistic  effect  do  not  change 
in  any  way  the  general  character  of  the  solution  that  was  obtained 

Z*oc3 

in  nonrelativistic   theory.    Formally,  the  additional  term    -^-    in 

Eq*  (15.23)  can  be  treated  as  a  relativistic,  attractive  potential 
energy,  which  obeys  an  inverse  square  law  and  which  affects  the 
solution  under  certain  conditions.  A  detailed  analysis  of  the  role 
of  this  term  will  be  given  below. 

First  of  all,  let  us  consider  the  asymptotic  solution  /?0  as 
/•-+0.  In  this  case,  Eq.  (15.23)  reduces  to 


p  _Q 
7  --  ~dr*  ---  7*  -     °~~ 

We  shall  look  for  a  solution  of  this  equation  in  the  form 


We  then  obtain  an  equation  for  s 


s(s+  !)  —  /(/+  1)  +  ZV  =  0,  (15.26) 

the  solution  of  which  is 


(15.27) 


.  ,      -  -~ 

Consequently, 

(15.28) 
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If 


both  roots  s,  and  s,  will  be  real  for  all  values  of/  (I  =  0,  1,  2,  .  .  .). 
We  retain  only  the  solution  for  r/?0  that  does  not  diverge  at  r  =  0; 
that  is,  we  set  Ca  =  0.  Similarly,  only  the  exponentially  decreasing 
solution  for  the  wave  function  as  r  —  oo  should  be  kept  when 
£<0  (A  >0).  The  asymptotically  decreasing  solutions  for  the  two 
limiting  values  of  r  yield  the  same  equation  for  the  energy  spectrum 
as  in  the  Schrbdinger  theory,  as  can  be  seen  from  Eq.  (13.20)  by 
substituting  Sj  for  /.  Thus,  for  the  eigenvalues  we  have  the  equation 


£V.  (15.29) 

Substituting  the  relativistic  values  (15.24)  of  the  constants  A  and 
B,  we  have  (for  n  =  k  +  I  +  1 ) 


(15.30) 


Expanding  this  expression  into  a  series  of  powers  of  Z2a2  and 
retaining  only  the  first  two  nonvanishing  terms,  we  obtain  an 
energy  spectrum  which  includes  relativistic  effects: 


n  3  \1 

1  --  T]   ' 

T-        /J 


(15.31) 


The  first  term  is  identical  to  the  nonrelativistic  expression.  The 
second  term,  which  is  proportional  to  the  square  of  the  fine 
structure  constant  a  ^- 1/137,  gives  the  relativistic  correction. 

The  relativistic  correction  for  the  hydrogen  atom  (Z  =  1 )  is 
interesting  because  it  removes  the  degeneracy  with  regard  to  /. 
The  level  for  a  given  n  is  split  into  n  closely  spaced  sublevels 
(the  close  spacing  is  a  consequence  ofthesmallness  of  a2)  because 
the  orbital  quantum  number  /  can  assume  n  different  values 
(/  =  0,  1,  2,  .  .  .  ,  n—  1 ).  In  order  to  compare  these  results  with 
experiment,  let  us  compute  the  distance  between  the  doublet  states 
of  the  Balmer  series  (rc  =  2).  We  find 

A,rt £21  —  £20 —  *.&*.  /l^  *^ 

A  n       ~~~  3  16  •  iio.twj 

Experimental  data  show  that  the  actual  distance  between  doublet 
states  in  the  Balmer  series  is  only  one  third  of  the  distance 
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given  by  Eq.  (15.32).  This  discrepancy  arises  because  the  fine 
structure  of  the  hydrogen  levels  cannot  be  explained  entirely  in 
terms  of  the  relativistic  relationship  between  mass  and  velocity. 
As  we  shall  see  in  Chapter  19,  it  is  also  necessary  to  consider  the 
electron  spin  (that  is,  the  intrinsic  angular  momentum  of  the 
electron,  which  gives  rise  to  a  magnetic  moment).  At  first  it 
was  thought  that  the  Klein-Gordon  equation  could  be  used  to  describe 
a  relativistic  electron.  As  a  result  of  the  discrepancies  between 
its  predictions  and  experiment,  however,  it  was  established  that 
it  describes  particles  with  spin  of  0,  whereas  the  electron  has  a 
spin  of  1/2.  Consequently,  the  Klein-Gordon  equation  can  be  used 
for  TU  mesons,  which  have  a  spin  of  0. 

Finally,  let  us  consider  the  case  in  which 

Za>-~  (15.33) 

in  Eq.  (15.27).  In  this  case,  the  solution  does  not  consist  of  a 
correction  added  to  the  nonrelativistic  solution,  but  is  fundamentally 
new.  Indeed,  for  1  =  0  both  roots  s{  and  s4  are  complex.  Therefore, 
the  asymptotic  solution  (15.28)  is 

flo  =  y=f  (C,«'T'  +  Ctf-  ln  (15.34) 

where  7=  I/  2V2  —  --.    We  cannot  impose  the  condition  C2  =  0  (or 

Ci  =  0)  on  our  problem  because  both  solutions  have  the  same 
singularity  as  r-+Q.  Since  the  solution  remains  unrestricted  by  a 
potential  barrier  at  small  r,  even  when  E<0§  the  energy  spectrum 
for  J  =  0  is  continuous.  Consequently,  the  particle  will  "fall"  to 
the  center. 

The  question  of  the  stability  of  the  motion  of  the  particle  is  very 
important  in  studies  of  the  central  forces.  The  above  results  can  be 
used  to  analyze  the  solution  of  the  Schrodinger  equation  in  the 
general  case  of  an  attractive  potential 


On  obtaining  the  asymptotic  solution  for  r->Q  [15.25],  we  see  that 
the  solution  will  vanish  at  the  origin  only  for  a  maximum  value  of 
q  equal  to  2  and  that  the  particle  will  not  fall  to  the  center  if 


It  is  interesting  to  note  in  this  connect  ion  that  the  r3  dependence 
of    potential    energy    occurs   fairly   frequently   in  the   theory  of 
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elementary  particles  since  the  potential  energy  of  this  form  charac- 
terizes the  inter  action  of  two  elementary  magnet  icdipoles.  Actually, 
two  cases  have  to  be  distinguished.  In  the  first  case,  V  ^r*  only  at 
relatively  large  distances,  while  at  small  distances  V  varies  as 
r"a.  This  behavior  of  V  is  observed  for  the  spin-orbit  interaction 
in  the  Dirac  theory  and  it  does  not  give  rise  to  any  difficulties. 
Moreover,  it  is  found  that  stable  motion  corresponds  to  a  value  of 
Z  greater  than  any  in  the  present  periodic  system  of  elements 
(Z  <  137)  because  the  spin  effects  reduce  the  influence  of  the 
relativistic  effects  (whereas  in  the  Klein-Gordon  theory  Z  is  con- 
fined to  relatively  small  values  Z  <  y  •  137). 

In  the  second  case,  V  continues  to  vary  as  r~3  even  at  small 
distances  (r->0),  and  particles  cannot  be  combined  into  atomlike 
systems.  This  case  can  be  observed  in  the  meson  theory  of 
nuclear  forces,  where  quasi-magnetic  interactions  are  of  con- 
siderable importance.  The  formation  of  an  atomlike  system  be- 
comes possible  only  if  the  potential  is  cut  off  at  small  distances 
from  the  origin. 


Problem  15.1.1  Find  an  expression  for  p  and  j  if  the  scalar  relativistic  equation  con- 
""       TBlllt'lffllug  from  the  presence  of  an  electromagnetic  field. 

Solution.   Let  us  substitute  p  — •  p  —  y  A,  E  -*  E  —  e<b  ln  the  Klein-Gordon  equation, 
which  now  describes  the  motion  of  a  particle  in  an  electromagnetic  field. 


Repeating  the  calculations  that  lead  to  Eq.  (15.15),  we  obtain  the  generalization 


where  the  four-dimensional  potential  A^  has  the  components 


i^Problem  15.2A  Show  that  in  the  case  of  time-independent  potentials  A  and  $,  the 
space*1  n  liiiLiiini1  'TfCordinates  in  the  Klein-Gordon  equation  can  be  separated  and  the  wave 
function  can  be  written  as 


Find  the  wave  function  of  a  free  particle  described  by  the  Klein- 
Gordon  equation  using  for  normalization  the  expression  for  the  density  p .  Show  that  in 
the  relativistic  case,  p  is  the  charge  density  rather  than  the  particle  density. 

Solution.  Suppose  the  momentum  of  the  particle  is  directed  along  the  z  axis  and 
that  the  particle  travels  in  a  segment  of  length  L  (the  one-dimensional  case).  The 
solution  of  the  Klein-Gordon  equation  can  be  written  in  the  form 


--= 
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where  E  =  ±.  cMC  is  the  energy  of  the  particle.  Since  the  charge  density  is  given  by  the 
expression 

teH 

f= 

the  total  charge  is  [see  (15.14)] 


-1/2 

It  follows  that  p  is  the  charge  density  rather  than  the  particle  density,  since  this  re- 
lation can  be  interpreted  only  if  it  Is  assumed  that  particles  described  by  an  amplitude 
B  (negative  energies)  have  a  charge  of  opposite  sign  from  particles  described  by  an 
amplitude  A  (E  >0). 


Chapter  16 

Motion  of  an  Electron  in  a  Magnetic  Field. 
Electron  Spin 


In  1896,  Zeeman  found  that  when  atoms  are  placed  in  a  strong 
magnetic  field,  their  spectral  lines  are  split  into  several  compo- 
nents. This  phenomenon  is  known  as  the  Zeeman  effect.  The 
Zeeman  effect  played  an  important  role  in  the  investigation  of  the 
structure  and  magnetic  properties  of  the  atom.  It  led  in  particular 
to  the  discovery  of  the  spin  (intrinsic  mechanical  moment)  and 
magnetic  moment  of  the  electron.  Accordingly,  it  is  worth  elaborat- 
ing the  theory  of  this  effect  in  some  detail. 


A.  THE  CLASSICAL  THEORY  OF  THE  ZEEMAN  EFFECT 

The  simplest  model  of  the  radiating  atom  in  Lorentz*  selection 
theory  is  based  on  the  assumption  that  the  electron  moves  under 
the  influence  of  an  elastic  force 

F=-kr.  (16.1) 

The  elastic  constant  ft  is  related  to  the  electron  mass  and  the 
angular  frequency  of  oscillation  o>0  by  the  expression 

ft=:w0o)J.  (16.2) 

The  equation  for  the  oscillations  of  an  electron  in  a  homogeneous, 
constant  magnetic  field  //,  therefore,  becomes 

m0/M-/W==-~7  fxH,  (16.3) 

where  e  =  —  e0  is  the  electron  charge.  Taking  the  components  of 
(16.3)  along  the  coordinate  axes  (the  z  axis  is  chosen  in  the  direc- 
tion of  the  field  //,  so  that  Hx  =  Hy  =  0,  H,  =  <JtT),  we  find 


(16.4) 
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Multiplying   the    second   equation  by  i  (i=  K-T)   and  adding  it  to  the 
first,  we  obtain 

(16.5) 

where  0  —  ^       is  the  Larmor  frequency  of  precession  and  £  =  *-{-  iy. 
jLtn^c 

For  o<u>0  the  solution  of  (16.5)  is  of  the  form1 


(16.6) 
and  it  follows  from  (16.4)  that  the  z  component  is 

z=:Ce±«M.  (16.7) 

From  the  above  expressions  it  can  be  seen  that  the  frequency  of 
oscillation  of  the  electron  (a  three-dimensional  oscillator)  changes 
under  the  influence  of  a  magnetic  field.  An  atom  placed  in  a 
magnetic  field  should  emit  radiation  at  three  frequencies: 

0)0  -  0,    OJ0l     (t)0-j~0.  (16.8) 

According  to  the  classical  theory,  an  oscillator  does  not  emit 
energy  in  the  direction  of  oscillation.  Therefore,  when  we  observe 
the  light  emitted  by  an  atom  in  the  ?  direction  (the  direction  of  the 
magnetic  lines  of  force),  we  are  able  to  detect  only  two  lines  (there 
will  be  no  component  <»>0  due  to  oscillations  along  the  z  axis).  In 
other  directions,  we  are  able  to  observe  all  three  components  (the 
normal  Zeeman  effect).  Equations  (16.6)  and  (16.7)  indicate  that 
the  oscillations  are  resolved  in  a  longitudinal  component  in  the 
direction  of  the  magnetic  field  (the  z  axis)  and  two  transverse 
components  corresponding  to  two  directions  of  rotation  (a  right- 
handed  rotation  and  a  left-handed  rotation).  Thus,  the  magnetic 
field  has  no  effect  on  the  longitudinal  oscillations  and  acts  only 
on  the  circular  rotations  in  the  plane  perpendicular  to  the  magnetic 
field. 

In  quantum  theory  a  change  in  the  frequency  of  oscillation  is 
always  associated  with  a  change  of  energy.  At  first  glance,  it  may 
seem  strange  that  the  magnetic  field  changes  the  energy  of  the 

electron,  since  the  Lorentz  force  F=  —   vxHis  perpendicular  to 

the  velocity,  and  therefore  the  work  done  by  this  force,  just  as  the 
work  done  by  any  centripetal  force,  must  be  equal  to  zero.  On  the 


Even  in  the  case  of  very  strong  fields  (&%*  '^  10  gauss)  the  quantity  o  is  of  the 
order  of  10  sec  ,  whereas  the  frequency  of  oscillation  of  an  electron  in  an  atom  (the 
optical  spectrum)  is  w0  ~  10  6  sec"  Therefore,  the  inequality  o  «  u>Q  is  practically 
always  satisfied 
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other  hand,  it  is  well  known  that  an  electron  rotating  in  a  circle 
(that  is,  a  current  loop)  forms  a  dipole.  The  energy  of  this  dipole 
in  a  magnetic  field  is  equal  to 

ym*g  —  —  '£9/f.  (16.9) 

These  two  conflicting  conclusions  may  be  explained  in  the  following 
manner.  As  the  magnetic  field  changes  from  zero  to  a  certain 
constant  value  Hg  =  <W9  the  electron  experiences  a  force  directed 
along  one  of  the  components  of  the  electric  field  g.  This  force 
imparts  an  additional  energy  to  the  electron.  The  magnitude  of 
this  component  of  the  electric  field  can  be  found  from  Faraday's 
law  of  induction  (second  Maxwell  equation): 


Assuming  that  ^T  and  g  depend  only  on  time  and  that  the  switching 
on  of  the  magnetic  field  does  not  alter  the  radius  of  the  stationary 
orbit,  we  find 


2c~dt 


The  additional  velocity  imparted  to  the  electron  (e=  —  e0)  by  this 
electric  field  can  be  found  from  the  equation 


dt  m0        2w0c    dt    * 

which  gives 

0mag_  J?JL  o^T 

2/w0f 

As  we  can  see,  fmae  is  independent  of  the  rate  of  change  of  the 
magnetic  field  when  it  is  switched  on.  Since  the  magnetic  field  is 
directed  along  the  z  axis  and  the  induced  electric  field  (and  there- 
fore also  umas)  is  perpendicular  to  it  and  to  the  radius  of  the  orbit, 
we  may  write  in  vector  form 


From  this  it  is  clear  that  the  magnetic  field  produces  an  additional 
rotation    of   the   electron   (Larmor   precession)   with   an   angular 

velocity  0=3^. 

We  may  now  determine  the  unknown  additional  energy  acquired 
by  a  rotating  electron  when  the  magnetic  field  is  switched  on.  Since 
the  energy  of  an  electron  placed  in  a  magnetic  field  is  determined 
entirely  by  its  kinetic  energy,  we  have 
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where  v  is  the  velocity  of  the  electron  before  the  magnetic  field  is 
switched  on.  Retaining  only  terms  that  are  linear  with  respect 
to  v  mas,  we  obtain 

Vmag=^^Hxr)=:^ff^rXv)f  (16.10) 

Comparing  (16.10)  with  (16.9),  we  see  that  the  magnetic  moment 
of  an  electron  moving  in  a  circle  is  given  by  the  expression 

p  =  —  g  r*v.  (16.11) 

Recalling  that  its  angular  momentum  is  equal  to 

£,  =  m0  r  x  #, 
we  find  a  simple  relation  between  these  two  quantities 

>=-&*•  (16-12) 

It  is  worth  noting  that  the  magnitude  of  the  magnetic  moment  can 
also  be  found  from  other  considerations.  As  we  know,  the  magnetic 
moment  of  a  current  loop  is 

»-?*• 

where  n*  is  a  unit  vector  normal  to  the  plane  of  the  current  loop. 
In  the  above  relationship  the  current  strength  is  equal  to 


where    T  =  ^r-  is   the  period  of  rotation,  while  the  area  enclosed 
by  the  current  is 


Combining  these  last  relations  we  again  obtain  the  expression  (16.11) 
for  the  quantity  ^  . 

B.    THE   ZEEMAN  EFFECT  IN  THE   NONRELATIVISTIC 
SCHRODINGER  THEORY 

In  order  to  obtain  the  Schrodinger  equation  for  an  electron 
moving  in  a  magnetic  field,  we  shall  use  the  general  rule  for  trans- 
formation of  the  classical  Hamiltonian  to  the  quantum  case  (see,  for 
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example,  Chapters).  To  do  this,  we  substitute  the  momentum  oper- 
ator p  into  the  classical  expression  for  the  energy  of  an  electron  in 
the  presence  of  electrostatic  and  magnetic  fields.  The  Schrodinger 
equation  for  the  central  forces  in  the  presence  of  a  magnetic  field 
characterized  by  the  vector  potential  A  then  takes  the  form 

(£~HS)^  =  0  (16.13) 

where  the  Hamiltonian  of  the  Schrodinger  equation  is 

-  (16.14) 

niQ 

and  the  operator  P  —  p—  e  A   is  called  the  generalized  momentum 

operator.  For  the  case  of  a  constant,  homogeneous  magnetic  field 
directed  along  the  z  axis  (Hx  =  Hy  =  Qt  HZ  =  <£F)9  we  may  write 


Using  the  fact  that 


where 


and  neglecting  the  terms  proportional  to  the  square  of  the  magnetic 
field  strength  *#',  we  find 


=  0.  (16.15) 

Remembering  that 


where 


we  reduce  the  Schrodinger  equation  to  the  form 
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It   is    easy  to  show  that  this  equation  is  satisfied  by  the  usual 
wave  function  for  a  centrally  symmetric  field: 

t==/?(r)r«(ftt  cp).  (16.18) 

Substituting  this  solution  into  (16.17)  and  recalling  that 


we  obtain  the  equation 


which   also  includes  the  effect  of  the  magnetic  field  on  the  atom. 
This  equation  may  be  written  as 


C16.19) 

where  we  take  the  charge  of  the  electron  as 


The  last  term  in  the  Hamiltonian  may  be  attributed  to  the  presence 
of  the  orbital  magnetic  moment  of  the  atom,  which  gives  rise  to  an 
additional  energy 

j!{;  (16.20) 

Therefore,  the  orbital  magnetic  moment  obtained  on  the  basis  of 
the  Schrodinger  theory  is 


Recalling  the  expression  for  the  z  component  of  the  angular  momen- 
tum 

Lg  =  hm, 

we  obtain  the  same  relation  between  the  magnetic  moment  and  the 
angular  momentum  as  in  the  classical  theory  [see  (16.12)]: 

=-•••  <16-22> 


It  follows,  therefore,  that  the  components  of  the  orbital  magnetic 
moment  are  multiples  of  a  certain  unit  magnetic  moment 
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^=5fi     =  9-273'  10~21  erg.gauss-1,  (16.23) 

which  is  called  the  Bohr  magneton. 

The  orbital  magnetic  moment  of  an  electron  is  one  of  the  most 
important  magnetic  properties  of  an  atom.  It  can  be  seen  from 
Eq.  (16.20)  that  the  additional  energy  of  an  orbital  electron  placed 
in  a  magnetic  field  is  given  by  the  expression 

(16.24) 


since  Vmas  is  a  constant,  where  o  is  the  Larmor  frequency.  Because 
of  the  selection  rules  for  the  magnetic  quantum  number  (A/n  =  0,  :+-  1), 
the  additional  radiation  frequencies  due  to  the  Zeeman  splitting 
are  the  same  as  in  the  classical  theory  [see  (16.8)],  namely, 

o  A/ii  =  0,  +o.  (16.25) 


The  ilormal  Zeeman  splitting2  of  the  spectral  lines  (triplets 
and  doublets)  is  encountered  only  in  the  case  of  a  strong  field  (the 
Paschen-Back  effect)  or  in  the  case  when  the  total  spin  of  the 
electrons  in  the  atom  is  equal  to  zero  (for  example,  in  parahelium, 
whose  outer  shell  consists  of  two  electrons  with  oppositely  directed 
spins).  In  cases  in  which  the  spectral  lines  are  split  into  more  than 
three  components,  the  Zeeman  effect  is  said  to  be  anomalous. 
The  anomalous  Zeeman  effect  is  connected  with  the  spin  properties 
of  electrons,  and  an  explanation  of  this  effect  can  be  constructed 
only  on  the  basis  of  Dirac's  theory,  which  takes  into  account  the 
spin  effects  (see  Chapter  20). 


C.   THE  EXPERIMENTAL  DISCOVERY  OF  ELECTRON  SPIN 

It  was  shown  in  the  last  section  that  ^theJfohrQdingar  theory  is  able 
^  e^plaii\_only  the  orbital  angular  momentum  and  magnetic  mofffent 
The  basic  equations  character!  zing  these  prop'er  ties 


are  Eq.   (16.22)  for  the  ratio  of  the  orbital  magnetic  moment  and 


Let  us  note  that  the  reasons  for  the  use  of  the  terms  "normal  Zeeman  effect"  and 
"anomalous  Zeeman  effect"  are  purely  historical  Before  the  discovery  of  electron  spin, 
only  the  classical  theory  of  triplet  splitting  (normal  Zeeman  effect)  was  known  When  a 
more  complicated  splitting  was  discovered  it  was  called  the  anomalous  Zeeman  effect 
because  no  theoretical  explanation  could  be  given  for  it  until  the  development  of  the  theory 
of  electron  spin 
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the  orbital  angular  momentum  and  Eq.  (16.24),  which  indicates  that 
the  number  of  possible  orientations  of  the  magnetic  moment  relative 
to  the  z   axis  is  necessarily  odd,  since  the 
number    of    states    with   different  quantum 
numbers  m  is  equal  to  2/  +  l.TheSchrodinger 
theory,  however,  does  not  adequately  account 
for  all  the  experimental  data,  the  analysis  of 
which  led  to  the  discovery  of  the  spin  prop- 
erties  of   electron.    Let  us  briefly  discuss 
these  experimental  results. 

1.  First  of  all  let  us  consider  the  Einstein- 
de  Haas  experiment  (1915),  which  was  carried 
out  in  order  to  verify  Eq.  (16.22): 


Fig  16  1.  Diagram  of  the 
Einstein-do  Haas  experi- 
ment for  the  determination 
of  the  Lande  g  factor. 
1)  quartz  fiber,  2)  cur- 
rent carrying  coil,  3)  fer- 
ro-magnetic  rod. 


where  g ,  the  Lande  factor,  should  be  equal  to 
unity  for  orbital  moments.  In  this  experi- 
ment, a  ferromagnetic  rod  is  suspended  on 
a  quartz  fiber  and  magnetized  by  passing  a 
current  through  a  coil  (see  Fig.  16.1).  As 
a  result,  the  rod  acquires  a  magnetic  moment 
and  an  angular  momentum  whose  magnitude 
can  be  determined  from  the  angular  rotation 
of  the  quartz  fiber.  If  an  alternating  current  is  passed  through  the 
coil,  an  alternating  torque  will  arise,  causing  torsional  vibrations 
in  the  ferromagnetic  sample.  In  addition,  resonance  can  be  used 
to  enhance  the  rotational  effect.  Experimental  measurements  of 
the  gyromagnetic  ratio  (16.22)  show  that  the  sign  of  this  ratio  is 
negative,  so  that  it  can  be  definitely  concluded  that  the  magnetiza- 
tion of  the  ferromagnetic  sample  is  due  to  the  motion  of  electrons. 
The  value  of  the  Lande  g  factor,  however,  turned  out  to  be  equal 
to  two  (£  =  2),  rather  than  to  the  unity  that  was  required  by  the 
classical  or  Schrodinger  theories.  This  g  value  was  not  explained 
until  the  development  of  the  theory  of  electron  spin  (see  below). 


II 


n 


<?mag  H(z) 


Fig.  16.2.  Diagram  of  the  Stern-Gerlach  experiment  for  the 
determination  of  the  magnetic  moment  of  monovalent  atoms 
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2.  Stern  and  Gerlach  (1921)  studied  the  behavior  of  a  beam  of 
atoms  in  an  inhomogeneous  magnetic  field  in  order  to  check  the 
theoretical  result  (16.23) 


which  describes  the  spatial  quantization.  In  their  classical  experi- 
ments a  beam  of  monovalent  atoms  (hydrogen,  lithium,  silver), 
traveling  along  the  x  axis,  crossed  a  magnetic  field  directed  along 
the  2  axis  (Hx  =  Hy  =  Qt  Hz  =  e?f).  This  magnetic  field  was  very 
inhomogeneous,  so  that  it  had  large  gradients.  Then  a  magnetic 
dipole  of  moment 

J*  =  *mag'.  (16.26) 

where  emag  is  the  "magnetic  charge  "  and  /  is  the  length  of  the 
dipole,  will  experience  a  force  directed  along  the  z  axis3 


-  ^  (*  -  1  COS  *)}  = 
=  p,^  =  -tod-!Jfm.  (16.27) 

Let  us  calculate  in  a  simplified  fashion  the  displacement  ex- 
perienced by  a  particle  under  the  action  of  the  force  Fz  during  the 
time  t.  If  the  particle  moves  with  a  velocity  v  perpendicular  to 
the  magnetic  field  (that  is,  to  the  z  axis)  and  travels  a  distance 
L  =  vt9  the  displacement  along  the  direction  of  the  z  axis  will 
equal 

(16'28) 


In  this  case  the  acceleration  is  w=  ^  ,  where  the  force  Ft  is  given 

by  (16.27),  and  M  is  the  mass  of  the  atom.  Consequently,  a  beam  of 
particles  possessing  a  magnetic  moment  |u,  will  be  split  into 
components  as  it  passes  through  an  inhomogeneous  magnetic  field. 
The  number  of  components  is  determined  by  the  possible  number 
of  projections  of  the  magnetic  moment  ^  on  the  direction  of  the 
field. 

In  their  experiments,  Stern  and  Gerlach  studied  the  splitting 
of  a  beam  of  atoms  in  the  s  state.  In  this  state,  the  angular 
momentum  and  consequently  the  magnetic  moment  of  an  atom  are 
equal  to  zero  (/  =  m  =  0),  and  therefore  there  shouldbe  no  splitting. 
If  the  electron  is  in  the  p  state  (£=  1),  then  triple  splitting  should 


We  note  that  in  order  to  determine  the  motion  of  the  center  of  mass  of  the  magnetic 
dipole,  it  is  quite  immaterial  whether  we  regard  it  as  a  rigid  dipole  or  a  current  loop 


MOTION    OF    AN    ELECTRON     IN    A    MAGNETIC    FIELD  277 

be  observed  because  of  the  three  possible  values  of  the  magnetic 
quantum  number 


Experiments  on  hydrogen,  lithium,  silver,  and  other  atoms  show, 
however,  that  the  beam  is  split  into  only  two  components.  This 
proved  the  existence  of  a  magnetic  moment  for  atoms  in  the 
s  state.  The  projection  of  this  magnetic  moment  on  the  z  axis  can 
assume  only  two  values.  The  measurements  of  the  quantity  p. 
showed  that  it  is  equal  to  one  Bohr  magneton 

<16-29) 

In  order  to  reconcile  the  results  of  these  two  classical  experi- 
ments, Uhlenbeck  and  Goudsmit  introduced  the  hypothesis  that 
an  electron  posses  an  intrinsic  angular  momentum  in  addition 
to  its  orbital  angular  momentum.  At  first  it  was  believed  that 
this  intrinsic  angular  momentum  could  be  treated  by  analogy 
with  a  top  spinning  about  an  axis,  and  therefore  it  was  called  the 
electron  spin.  It  must,  however,  be  emphasized  that  no  rigorous 
classical  theory  of  spin  exists.  According  to  the  hypothesis  of 
Uhlenbeck  and  Goudsmit,  the  intrinsic  angular  momentum  of  an 
electron  is  equal  to 

sz  =  ±~h;  (16.30) 

that  is,  the  quantum  number  characterizing  its  projection  on  the 
z  axis  takes  on  half-  integral  values  ms  =  ±\  .  The  important 


distinction  between  the  integral  (orbital  magnetic  quantum  number 
m)  and  the  half-  integral  (spin  quantum  number  /;/,  )  values  of 
quantum  numbers  lies  in  the  number  of  possible  states.  Integral 
quantum  numbers  always  give  us  an  odd  number  of  states  (for 
/  =  0  we  have  one  state  m  =  Q\  for  l  =  \  there  are  three  states 
m=0,  -|-lf  —  1;  and  so  on).  On  the  other  hand,  half-  integral 
quantum  numbers  give  us  an  even  number  of  states  (for  example, 

1  113 

for  s  =  -£•    there   are   two    states    ms  =  -\-^t  ~~7  *»  f°r  5==lf  ^ere 

are  four  states;  and  so  on). 

The  assumption  of  the  half-integral  quantum  numbers  was 
introduced  even  before  Uhlenbeck  and  Goudsmit  in  order  to  explain 
the  double  splitting  of  terms  for  the  monovalent  atoms.  The  Stern- 
Gerlach  experiment  showed  that  there  are  two  possible  electron 
states  in  a  monovalent  atom;  that  is,  the  electron  spin  must  be 
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described  by  the  half- integral  quantum  numbers  corresponding  to 
two  opposite  orientations.  Recalling  that  the  Einstein-de  Haas 
experiment  showed  that  the  Lande  g  factor  in  Eq.  (16.29)  is  equal 
to  two  (g~2)  and  the  intrinsic  angular  momentum  is  given  by 
Eq.  (16.30),  we  find  the  following  expression  for  the  z  component 
of  an  intrinsic  magnetic  moment: 

^  =  -£c*.  =  =FI*o.  (16.31) 

which  is  simply  one  Bohr  magneton.  The  introduction  of  the 
electron  spin  also  made  it  possible  to  explain  the  multiple  splitting 
of  the  spectral  lines  of  atoms,  as  well  as  their  magnetic  properties. 


D.    PAULI   EQUATION 

A  nonrelativistic  wave  equation  that  includes  the  intrinsic 
magnetic  moment  of  the  electron  was  first  proposed  by  Pauli.  For 
this  purpose  he  took  the  ordinary  Hamiltonian  of  the  Schrodinger 
equation  and  added  to  it  a  term  represent  ing  the  interaction  between 
the  magnetic  moment  of  the  electron  M-  and  the  external  magnetic 
field  H  : 

l/p  —  —  \L*H.  (16.32) 

Then  the  time-  independent  Schrodinger  equation  takes  the  form 

{£  —  HS   -\p.  H}ty  =  Q,  (16.33) 

where  the  Hamiltonian  IIs   is 


HS    =2 

Next,  it  was  necessary  to  find  suitable  quantities  to  describe 
the  intrinsic  magnetic  moment  of  the  electron.  It  is  well  known 
that  introduction  of  the  spin  is  related  to  introduction  of  a 
fourth  quantum  number,  characterizing  the  internal  properties  of 
an  electron.  On  the  other  hand,  the  wave  function  ^  of  a  particle 
depends  only  on  three  quantum  numbers,  corresponding  toquantiza- 
tion  of  the  three  spatial  coordinates.  Jn_order  to  describe  spin, 
Pauli  introduced  two  wave  functions  Wj  and  U^JTifplace  of  the  single 
Jwavejunctlon  <J».  One  of  the  wave  functions  describes  a  state  'With 
one  spin  orientation  and  the  other  wave  function  describes  a  state 
with  the  opposite  spin  orientation.  The  actual  wave  equation 
represents  a  system  of  two  equations.  It  is  possible  to  represent 
a  system  of  two  or  more  equations,  such  as 
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by  a  single  equation  in  matrix  notation 

'  <16'35a> 


where  the  multiplication  is  carried  out  according  to  the  rule  for 
the  multiplication  of  matrices  (c)  =  (a)(b):  namely,  an  element  of  a 
matrix  product  is  obtained  by  multiplying  each  element  in  the 
appropriate  row  of  the  first  matrix  by  the  corresponding  elements 
of  the  appropriate  column  of  the  second  matrix  and  taking  the  sum 
of  these  products,  that  is, 

*i*  =  2fliA*-  (16.36) 

n 

Pauli  suggested  selecting  the  wave  function  *F  in  the  form  of  a  one- 

/ur  \ 
column  matrix  W  =  {  wl)  and  setting  the  intrinsic  magnetic  moment 

\^3' 

of  an  electron  equal  to 

ji,  =  —  ^0',  (16.37) 

where  |AO  is  the  Bohr  magneton  and  a'  stands  for  the  three  2x2 
Pauli  matrices 


These  matrices  are  denoted  by  the  letter  a  with  a  prime  (the  same 
letter  without  a  prime  will  be  used  to  denote  the  4  x  4  Dirac 
matrices).  These  matrices  characterize  the  components  of  the 
spin  vector  along  the  coordinate  axes. 

Using  the  rule  (16.36)  for  matrix  multiplication,  it  can  be 
readily  shown  that  the  square  of  each  Pauli  matrix  is  equal  to 
unity 

aj'^a^a.;3^!,  (16,39) 

where  I'  denotes  a  2  x  2  unit  matrix  L  j  j  .  It  can  also  be  shown 
that  different  matrices  antic  ommute  with  one  another: 


(16.40) 
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In  terms  of  the  above  matrix  expressions,  the  nonrelativistic 
Pauli  equation  has  the  form 


i  ->.]}(;:;)=«. 


This  matrix  equation  is  equivalent  to  a  system  of  two  ordinary 
equations,  each  of  which  corresponds  to  one  of  the  rows  of  the 
matrix 

(£_HS  -ptHJVt-friHs-iHJV^Q. 

(E  _.  H  S  .f  |JLo//j  V,  -  (A()  (//,  +  tHy)  \V,  =  o.  (     • 

Let  us  consider  the  case  of  an  electron  moving  in  a  magnetic  field 
directed  along  thez  axis  (HX  =  HV=(\  //,=-  JT).  Using  the  Hamil- 
tonian  (16.19),  which  includes  the  effect  of  a  magnetic  field,  we 
obtain  two  equations  of  motion  for  the  electron 


-  22  J  1ri  =  0, 
E  +  e.<S>  -  -  wX'm  +  \^'  ~     '    Uf  a  =  0, 


where  iv^'///  is  the  energy  of  interaction  between  the  magnetic 
field  G>V"and  the  orbital  magnetic  moment,  and  driv^'  is  the  energy 
of  interaction  between  the  magnetic  field  and  the  spin  magnetic 
moment.  In  the  s  state  the  magnetic  quantum  number  m  is  equal 
to  zero,  so  that  the  Pauli  equation  takes  the  form 


(16.44) 


that  is,  the  wave  function  ^  describes  a  state  in  which  the  intrinsic 
magnetic  moment  of  the  electron  is  parallel  to  the  z  axis,  and  the 
wave  function  ^3,  a  state  in  which  the  magnetic  moment  is  anti- 
parallel  to  the  z  axis.  These  are  the  orientations  of  the  intrinsic 
magnetic  moment  which  were  observed  in  the  Stern-Gerlach 
experiment. 

As  the  function  *Fh  Pauli  suggested  taking  the  Hermitian  adjoint 
of  W,  that  is,  the  matrix  \F+=(V^J)t  whose  elements  are  ob- 
tained by  taking  the  complex  conjugates  of  the  elements  of  UT 
and  transposing  them  (interchanging  rows  and  columns).  Thus,  if 
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W  is  a  column  matrix,  \Fh    will  be  a  row  matrix.  The  probability 
density  will  be  given  by 

^^-pjnqr^  (16.45) 

which  includes  the  possibility  of  two  spin  orientations. 

The  other  matrix  elements  are  formed  in  a  similar  manner. 
For  example, 


_  UT*IF       urnir  . 

—  ^<*'~~^    "  (16.46) 

that  is,  ur?^,  and  UTJ^  represent  the  probability  densities  of  states 
in  which  the  electron  has  a  spin  orientation  parallel  and  antiparallel 
to  the  z  axis,  respectively.  Using  the  expression  for  the  intrinsic 
magnetic  moment  in  the  Pauli  theory 


and  the  Einstein-de  Haas  relation  between  the  intrinsic  magnetic 
moment  and  the  angular  momentum 


v.=--e°  s, 

r  /fluf 

we  find  that 


(16.47) 


Thus,  in  agreement  with  the  other  experimental  facts,  the  z  com- 
ponent of  the  spin  angular  momentum  is  equal  to  iL  V,  . 

Since  the  spin  operator  is  expressed  in  terms  of  the  matrices 
a',  the  spin  components  do  not  commute.  In  this  they  resemble  the 
components  of  the  orbital  angular  momentum,  which  are  operators 
depending  on  derivatives  [see  (11.75)  and  (11.  76)].  The  commutation 
relations  satisfied  by  the  spin  operators  can  be  easily  established 
from  (16.40)  and  (16.47): 


(16.48) 


282  RELATIVISTIC    QUANTUM    MECHANICS 

Concluding  our  discussion,  we  note  that  the  Klein-Gordon  theory, 
which  includes  the  relativistic  effects  but  neglects  the  spin  effects, 
and  the  Pauli  theory,  which,  on  the  contrary,  neglects  the  relativistic 
effects  but  includes  the  spin  effects,  were  predecessors  of  the  more 
rigorous  theory  of  the  electron  developed  by  Dirac,  which  predicts 
all  the  elementary  properties  of  the  electron?)  It  should  be  noted 
in  this  connection  that  the  absolute  value  of  tTre  intrinsic  magnetic 
moment  was  introduced  in  the  Pauli  theory  from  purely  empirical 
considerations. 

^  P^fthlfilD, jLfiji"* Show  that  in  nonrelativistic  quantum  mechanics,  just  as  in  the  classical 
theory,  the  Zeernan  effect  is  due  to  the  precessional  motion  of  the  orbit  in  a  magnetic 
field,  the  motion  having  the  Larmor  frequency. 

Solution.  From  the  Hamiltonian  of  the  Schrodinger  equation  for  the  case  of  an  electron 
moving  in  a  magnetic  field  directed  along  the  z  axis 


we  can  find  the  time  derivatives  of  the  angular  momentum 


where  o  is  the  Larmor  frequency.  It  follows  that  the  component  of  the  angular  momentum 
in  the  field  direction  (z  axis)  is  a  constant  of  the  motion.  The  components  along  the  x 
and  y  axes,  however,  precess  around  the  z  axis  with  the  frequency  o  . 


>lem  16,2Ji  Show  that  the  spin  operator  S  is  vectorial;  that  is,  if  we  construct  the 
1  inear  comTSmatton 

S;.  =  a,Sv+  PiSj+TiS-,     S'v  =  aaSA  +  PsSj,  +  T&, 


where  a,  (J,  f  are  the  directional  cosines,  then 

S;S'V  —  SVS;  =  i»s;  and  so  on. 


roblern  16.3J  Show  that  in  a  homogeneous  magnetic  field  which  is  a  function  of  time 
only,  BUT  Wave  function  of  the  Pauli  equation  can  be  resolved  into  a  product  of  coordinates 
and  spin  functions.  What  form  does  this  solution  take  if  the  field  is  time  independent? 
Solution.  Let  us  look  for  a  solution  of  the  Pauli  equation  in  the  form 


It  is  readily  shown  that  the   coordinate  part  of  the  wave  function  ^  (r,  t)  satisfies  the 
ordinary  Schrodinger  equation  without  the  spin 


while  the  spin  part  of  the  wave  function  may  be  obtained  from  the  equation 
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The  spin  part  of  the  wave  function  is  normalized  as  follows 


In  the  case  of  a  stationary  magnetic  field  it  is  easy  to  determine  the  time-dependent 
part  in  the  above  equations.  We  simply  set 


-  '  (E-E)t 

n 


Then  the  time-independent  parts  of  the  wave  function  are  determined  from 


16.4.h  Find  the  eigenvalues  of  the  operator  of  a  spin  component  along  the 
direction  SpWlH&a  by  the  spherical  angles  d  and  <p.  Investigate  the  particular  cases  in 
which  this  direction  is  the  x9  y  or  z  axis. 

Solution.   Consider  first  the  case  in  which  the  spin  is  directed  along  the  z  axis.  Then 
the  initial  equation  takes  the  form 


where 


This  matrix  equation  is  equivalent  to  two  homogeneous  algebraic  equations 


--  C.  +  XC,  =  0. 


The  normalized  solutions  of  these  equations  have  the  form 


s  =_ft /i     o' 


The  first  evidently  corresponds   to  the  case  in  which  the  spin  is  directed  along  the  z 
axis;  the  second  to  the  case  in  which  the  spin  is  directed  along  the  -z  axis. 

The  operator  for  the  component  of  spin  along  the  direction  defined  by  the  spherical 
angles  0  and  y  with  respect  to  the  coordinate  axes  is  equal  to 

S  =  sin  ft  cos  <p  S*  +  sin  ft  sin  <?  Sy  +  cos  &  S^, 
where 
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Hence  from  the  equation 

*  (?;}=»'(£) 

we  find  two  solutions: 

(a)  the  solution  corresponding  to  the  case  in  which  the  spin  is  parallel  to  this  direction 


i         }}       ; » 
\    sin  —  el*  I 


(b)  the  solution  corresponding   to  the  case  in  which  the  spin  is  antiparallel  to  this 
direction 


, 

-  cos  —  <?«P    I 

\  ^  J 

Setting  ft  —  0,  <p  —  0  we  obtain  the  same  solution  as  above.  Hie  cases  in  which  the  spin  is 
directed  along  the  x  or  y  axes  may  be  obtained  by  setting,  respectively, 


x^  Problem  Ib-ST^The  electron  spin  is  parallel  to  the  2  axis.  Find  the  probability  that 
the  component  of  the  spin  (a)  in  a  direction  parallel  to  the  i  axis,  and  (b)  in  a  direction 

making  an  angle  D  with  the  t  axis,  will  have  the  values  -,y  h  and  —  ~  hm 

Hint.  Take  the  wave  function  describing  the  state  in  which  the  spin  is  parallel  to  the 
z  axis  and  then  expand  it  in  terms  of  the  functions  corresponding  to  the  cases  in  which 
the  spin  is  parallel  and  antiparallel  to  the  direction  forming  an  angle  &  with  the  z  axis. 
Both  these  functions  are  given  in  Problem  16.4.  Without  loss  of  generality  we  may  set 
the  angle  cp  =  0. 

'ITien  the  squared  modulus  of  the  expansion  coefficients  gives  the  probabilities 


of  the  components  of  the  spin  along  the  corresponding  directions;  these  are  equal  to 


In  order  to  find  the  x  component  of  the  spin,  we  must  set  d  ==  -    in  the  last  equations. 


Chapter  1  7 


The  Dirac  Wave  Equation1 


A.    LINEARIZATION  OF  THE  ENERGY  OPERATOR. 

DIRAC  MATRICES  AND  THEIR  RELATION  TO 

PAULI  MATRICES 

As  indicated  in  Section  15,  relativistic  quantum  mechanics  is 
based  on  the  well-known  relativistic  relation  between  the  energy  E, 
momentum  p9  and  rest  mass  m0: 

E  =  cV7*  +  //#*.  (17.1) 

To  obtain  the  wave  equation  describing  a  free  particle,  we  substitute 
the  appropriate  operators  into  this  equation 

E  =  0ljt,    p  =  -f*v.  (17.2) 

and  act  with  these  operators  on  the  wave  function.  It  is  impossible, 
however,  to  make  a  direct  transition  to  operators  in  (17.1)  because 


^  ^ 

the  radical  sign|frj£  *s  therefore  necessary  to  get  rid  of  the  square 
root"Tn^(tT.l).  Cm?  way  of  doing  this  is  to  take  the  square  of  Eq. 
(17.1).  This  gives  the  relativistic  Klein-Gordon  wave  equation  with 
a  one-component  wave  function.2  As  already  noted,  this  equation 
describes  the  motion  of  spinless  particles  and  is  not  applicable  to 
electrons,  whose  spin  is  equal  to  1/2  (in  units  of  ft). 

A  different  method  of  obtaining  a  linear  relativistic  wave 
equation  was  adopted  by  Dirac  (1928).  This  method  gave  a  first- 
order  wave  equation  and  consisted  in  linearizing  the  relation 
(17.1).  It  led  to  the  discovery  of  the  relativistic  wave  equation  for 
the  electron.  This  equation  plays  a  fundamental  role  in  relativistic 
quantum  mechanics  and  quantum  field  theory  since  it  provides  a 

^ee  P    A    M    Dirac,  Principles  of  Quantum  Mechanic?,  New  York    Oxford  University 
Press,   1958 

More  exactly,  we  in  fact  have  a  function  with  two  components  0j  ~  »/>  and  \f/2  —  ~~5r» 

cot 

since  a  second  derivative  with  respect  to  time  appears  in  the  fundamental  equation  One 
degree  of  freedom  corresponds  to  particles  with  positive  energy,  the  other  to  particles  with 
negative  energy  It  was  shown  by  Pauli  and  Weisskopf  that  the  negative  energy  states  can 
be  eliminated  by  carrying  out  a  second  quantization  of  the  scalar  equation  and  introducing 
spinless  particles  with  charges  of  opposite  signs 
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suitable  description  of  the  motion  of  particles  of  spin  1/2.  The 
discovery  of  this  equation  was  the  most  important  advance  in  the 
theory  of  the  electron  since  the  Maxwell-  Lorentz  equations  of 
classical  electrodynamics.  Bohr's  semiclassical  theory  and  non- 
relativistic  quantum  mechanics  served  only  as  transitional  theories. 
The  relativistic  relation  between  energy^  and  momentum  is 
1  inearpEjC&y,  .^extr^^ 

normal   with  .the  aid  of  matrices  >    For  this  purpose  we  represent 
"f  or  m~ 


where 

p()  =  m()c)  pi  =  pxt  Pi  =  Py,  PA  —  PZ-  (17.4) 

We  note  that 


To  determine  what  conditions  the  quantities  ajJL,  must  satisfy,  we 
square  both  sides  of  (17.3).  Then,  if  the  operators  p^  and  /V  com- 
mute, we 


P*'W  =  Y 

|JL    [J.1  JJ.       p.' 

Equation  (17.6)  coincides  with  (17.5)  only  if 

VV  +  V°V  =  2SiM^'»  (17.  7) 

that  is,  all  four  quantities  a;i  anticommute  with  one  another 

a(LV  +  V°V  =  0     ji^n,'  (17.8) 

and  the  square  of  each  of  them  is  equal  to  unity 

aj=l  (17.9) 

We    recall   that   the  2x2  Pauli  matrices  also  possess  analogous 
properties 


f  These  operators  commute  with  each  other  if  there  is  no  electromagnetic  field  There- 
fore, Dirac  proposed  that  one  should  first  extract  the  square  root  of  the  operator  for  a  free 
particle,  and  then  generalize  the  resulting  equation  to  the  case  when  fields  are  present 
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since  they  anticommute  [see  (16.40]  and  the  square  of  each  is  equal 
to  unity  [see  (16.39)].  To  extract  the  square  root  of  the  four-term 
polynomial,  however,  it  is  necessary  to  have  four  relations  (17.7) 
(fj,  =  0,  1,  2,  3),  instead  of  three  [Eqs.  (16.39)  and  (16.40)]  that  are 
satisfied  by  the  Pauli  matrices. 

Accordingly,  Dirac  proposed  that  we  take  a  system  of  4  x  4 
matrices  °n  and  p«  that  are  related  to  the  2  x  2  matrices  by  the 
expressions 

(i7.il) 

(17.12) 

where  a'n  are  the  Pauli  matrices 


Hence   we  find,  for  example, 

/O  1  0  0\ 

aj==(o  o  o  ?    andsoon- 

\0  0  1  O/ 

The  properties  of  these  matrices  are  similar  to  those  of  the 
Pauli  matrices,  as  may  be  easily  checked  by  direct  multiplication. 
In  particular,  it  turns  out  that  their  squares  are  equal  to  unity 

oi  =  P«  =  I  (17.13) 

or,  more  exactly,  are  equal  to  the  4x4  unit  matrix 

/I  0  0  Ov 

I==(o  0  1  Or  (17.14) 

\0  0  0  1 / 

As  in  the  case  of  Pauli  matrices,  we  have 

°i<32  —  —  32<*i  =  *3.i.     Pip2  =  —  P^Pi  =  ipa  and  so  on.        _  _  .  _v 
On(v  =  rv0(i    („.„'«!,  2i3>  (17'15) 

From  this  it  follows  that  the  different  matrices  a  anticommute  with 
one  another  (a  similar  conclusion  is  also  true  for  the  system  of  p 
matrices): 

The  matrices  an  and  Pn'f  however,  commute  with  each  other. 
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Dirac    proposed    that    the   matrices    o^     [see   Eq.    (17.3)]    be 
chosen  as  follows: 

an  =  p,on==:(0;^  („===!.   2.3),  (17.17) 

l\r    o"\ 


(17.18) 

In  conventional  notation  these  4x4  matrices  have  the  form 

/O  0  0  1\  /O      0  0  —i\ 

I/O  0  1  0\  /O      0  /       0\ 

ai""    0  1   0  0   '     a-'"~l  0    ~t  0       0   ' 

V.oooV  v,     oo     o/  g) 


v/ 


Multiplying  the  above  matrices  by  one  another,  it  is  easy  to  show 
that  they  satisfy  the  relations  (17.7). 

B.    THE  DIRAC  EQUATION.   CHARGE  DENSITY 
AND  CURRENT  DENSITY 

Let  us  substitute  the  corresponding  operators  into  the  linearized 
relativistic  relation  (17.3)  between  the  energy  and  momentum.  We 
obtain  the  Dirac  equation  for  a  free  particle4 

(E—H)t  =  0,  (17.20) 

where  the  operators  E  andp  are,  as  usual,  equal  to 

E  =  ift~,    p= 


Because   of  the  four  components   of  the  wave  function   i/>,   each  state  can  have  either 
positive  or  negative  energies  (see  below)  and  two  directions  of  the  spin  (see  Chapter  18) 
In    the   classical   case   the   relation   (17  1)   between   the   energy   and  momentum   can  be 
represented  in  a  form  similar  to  (17  20) 


This  equation  is  easily  verified  if  it  is  remembered  that  for  a  free  particle 


Consequently  the  matrix  «   >  v  c  must  play  the  role  of  the  velocity,  while  p3  ->\/l~  fi2 
a  scalar  that  characterizes  the  Lorentz  contraction 
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and  the  Hamiltonian  H  is  given  by 

H  =  f(a.p)  +  Pom0c2.  (17,21) 

Since  a  and  p3    are  4x4  matrices,-  the  wave  function  <|»  must  have 
four  components,  which  we  combine  to  form  a  single- column  matrix 


(£7-22) 


The    complex   conjugate    of  this    function    is  understood  to  be  the 
Hermitian  adjoint,  that  is,  the  row  matrix 

**=(*;*?«*:)•  (17-23> 

The  Dirac  wave  equation  (17.20)  is  therefore  equivalent  to  a 
system  of  four  equations 

(E  —  m0c2)  <h  —  c  (px  —  ipy)  ^4  —  q>*4>3  =  0, 

(E  —  m^)  <h  —  c  (pv  -j-  ipy)  4>3  +  cp^4  =  0,  (17.24) 

(E  -4-  m$*)  ^3  —  c(px  —  ipy)  <h  —  cp^,  =  0, 

(E  -f  mac2)  <|>4  —  c  (p,  -j-  *pv)  ti  +  cp  A  =  0.  ) 

In  the  case  of  motion  of  an  electron  in  the  electromagnetic  field 
specified  by  the  given  vector  and  scalar  potentials  A  and  a>,  we 
can  still  use  Eqs.  (17.20)  and  (17.  24),  but  the  energy  and  momentum 
operators  have  to  be  generalized  in  accordance  with  the  general 
laws  of  quantum  mechanics:  / 

F  =  iH  -£•  —  <&,     P  =  —  iflV  —  ^A  (17.25) 

The    complex  conjugate  of  the  wave  equation  may  also  be  repre- 
sented in  the  form  of  a  single  matrix  equation 

f-(F  —  c(o-P)  —  p3maca)  =  0,  (17.26) 


where   the   action   of  the   operators  iti  -*.   and  —  iflV   on  the    wave 

function  which  is  on  their  left  should  be  taken  to  be  the  same  as 
in  Eq.  (17.20)  but  with  opposite  sign,  that  is, 

_  ^ih  V  =  iHVp;     yih  -dT  =  —  ih  ~  <!>+.  (17.  27) 
For  a  free  particle,  Eqs.  (17.20)  and  (17.26)  now  become 

'*  W  *  +  icH  (a  '  V)  *  —  P;jm^*  =  °'  (17.28) 

'*       ^+icn  W"  '  °)  +  /vVP^  =  0-  (17.29) 
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Multiplying  (17.28)  on  the  left  by   <>+    and  (17.29)  on  the  right  by  <|> 
and  adding  them,  we  obtain 

7<J  *>  +  V-  fa*  =  0,  (17.30) 

which  may   be    interpreted   as   the   equation  of  continuity  for  the 
probability  density  p  and  the  current  density  j\ 


=0,  (17.31) 

where  s 

p  =  ef<|>,    j==a^a'l».  (17.32) 

If  we  write  the  last  equation  in  terms  of  components  of  the  wave 
functions,  rather  than  in  terms  of  matrices,  we  obtain 


(17'33) 


that  is,    PO  is  a  matrix  consisting  of  a  single  element  (it  is  just  a 
number  )0  In  exactly  the  same  way  it  is  readily  shown  that 

O  0  0 

J  J  J  J 

i  o  o  oy 

(17.33a) 

We  note  that,  contrary  to  the  Klein-Gordon  theory,  the  density 
p0  is  a  positively  defined  quantity.  This  does  not  mean,  however, 
that  in  the  Dirac  theory  p,,  can  be  considered  the  particle 
density.  Just  as  in  the  Klein-Gordon  theory,  there  will  be  particles 
with  a  sign  opposite  to  that  of  the  electrons  (positrons).  From 
Eq.  (17.32)  it  can  again  be  concluded  that  ca  should  be  regarded 
as  the  velocity  operator. 


*  Similar  relations  will  also  hold  in  the  rase  when  a  field  is  present 

In  second  quantisation,  the  definition  of  PQ  as  a  positive  quantity  means  only  that 
Fermi  statistics  should  be  applied  to  the  particles  (for  example,  in  the  case  of  the  Dirac 
equation),  if  pQ  may  take  cither  positive  or  negative  values  (for  example,  in  the  case  of 
the  Klein-Gordon  equation),  then  Bose  statistics  should  be  applied  to  the  particles 
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C.    TRANSFORMATION  PROPERTIES  OF  THE  WAVE 

FUNCTION  UNDER  LORENTZ  TRANSFORMATIONS 

AND  SPATIAL  ROTATIONS 

According  to  the  general  principles  of  the  special  theory  of 
relativity,  physical  laws  must  be  independent  of  the  choice  of  the 
Lorentz  frame  of  reference.  Therefore,  the  Maxwell  equations, 
the  Klein-Gordon  equation,  and  the  Dirac  equation  must  all  be 
invariant  under  the  Lorentz  transformations.  Let  us  investigate 
the  transformation  properties  of  the  Dirac  wave  function.  The 
Lorentz  transformations  can  be  written  as 


c/'~ctfcoshY —  Arsinlv,-,  x?  =  Xcosh'( —  ^  sinh  Y,  if  —  y,  z'~z,     (17.34) 
where 

coshT=--J_   --     sinhY  — --    ^        ,   $=--. 

V\—  BJ  '        1^1  —  3-'  f 


This    transformation   must    be    satisfied   by   all  four-dimensional 
vectors,  including,  in  particular,  the  charge  and  current  densities 

cp'  =  cpcosh  T  —  /vsinhY,     /;  =  /,.  cosh  7  —  cpsinhy,     j'vs  =  jVtZ  (17.35) 

The  definition  of  these  quantities,  according  to  the  Dirac  theory, 
gives 


—  a,  sinh  r)  '|>  =  <j/e~T*n|>, 
<|/*-a,<|/  =  ty+  (a,  cosh  Y  —  sinh  T)  ty  =  yw"^,  (17.36) 


Here  we  have  used  the  fact  that  e~  ^*»  =  cosh  Y?I  -  sinh  Y»I  =  cosh 
Y  —  a,  sinh  Y»  since  a-f"  =  1,  v]n±{=  yit  where  //  is  an  integer.  In 
order  to  satisfy  the  above  relations,  we  must  set 


<!/=(cosh    J  —  3»,sinh  J^  =  ^Tai  ^,f 

V  2j  (17.37) 


Then,  since 

-JU,          _J.ff.  __L71         JL«f 

2  „        2  2       !  2       f  /I  7    OQ\ 

a^  =^=(?  c/j,     7,2<?  =e        72 1  ^i/.ooj 

it   is    easy   to   show  that  the  relations  (17.36)  are  correct.  From 
(17.37)  it  can  be  seen  that  the  wave  functions  do  not  transform  as 
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vectors  (whole  angle  7)  or  as  tensors  (double  angles  7),  but  as 
spinors,  the  transformation  of  which  is  characterized  by  the  angles 
7/2.  jpinorg_frjfi  alfigugalled  tensors  of  rank  1/2. 

In  a  similar  m a nner",""" "If "IHay"" Be' ' 'aBowST'tflftTthe  transformation 
law  of  a  spinor  under  an  ordinary  spatial  rotation  (for  example, 
around  the  z  axis  by  the  angle  <p )  is  as  follows: 

./  I3s  ~2~  .  ,  r\.          ,  4.  ~~  *8  T*  /1 7    QQ\ 

tb  - — " C         u       6    rz= 'li  ^  *  ^JLi.u*// 

The  above  relations  follow  from  the  transformations  for  the  cur- 
rent vector: 

/'*  =  /*  cos  <p  +  /y  sin  <p, 

]'y  =  iy  cos  9  —  j\  sin  <p,  (17.40) 


these  transformations  are  represented  in  the  Dirac  theory  as 

f  l«itf  =  'f  (a»  cos  f  +  a2  sm  <p)  (j»,  (17.41) 

i[/la,r|/  =  <!)+a3t|)  and  so  on. 

By  substituting  the  values  for    <j>r  from  (17.39),  and  using  the  fact 
that 


,     .  •        • 

~-  +  ra3  sin      =  cos  -   —  ro.,  sin  ~  at  = 


-*,! 


=  e          a,,  a, 

we  can  obtain  the  relations  (17.40). 


Chapter  1« 

The  Dirac  Theory  of  the  Motion  of  an  Electron 
in  a  Central  Field  of  Force 


A.    ORBITAL,  SPIN  AND  TOTAL  ANGULAR  MOMENTA. 
CONSERVATION  LAWS 

We  shall  determine  the  angular  momentum  of  an  electron  from 
the  conservation  laws  characterizing  the  motion  of  an  electron  in 
a  central  field  of  force: 

V  =  e<b(r)  (18.1) 

(for  example,  an  electron  moving  in  the  Coulomb  field  of  a  nucleus 

Ze*\ 
V  _  --  ^  j  t   it  was  shown  in  the  nonrelativistic  Schrodinger  theory 

that  the  orbital  angular  momentum  is  conserved  in  a  central  field 

L=rxp.  (18.2) 

In  the  Dirac  theory,  however,  which  takes  into  account  the  electron 
spin,  the  component  of  the  orbital  angular  momentum  does  not 
commute  with  the  Hamiltonian 


V(r)  (18.3) 

and  therefore  it  is  not  a  constant  of  the  motion.  Indeed 

a*,)  *  0.  (18.4) 

In  order  to  generalize  the  law  of  conservation  of  angular  mo- 
mentum to  particles  having  spin,  we  shall  use  the  relation 


-«lPy).  (18.5) 

It  follows  from  this  that  the  operator 

~ftcr  =  L  +  S  (18.6) 
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commutes  with  the  Hamiltonian  operator  H  and  thus  is  a  constant 
of  the  motion. J 

This  result  may  be  interpreted  in  the  following  manner.  The 
electron  has  an  intrinsic  angular  momentum  (spin).  We  have  just 
found  that  only  the  total  angular  momentum  is  conserved  (the 
orbital  angular  momentum  plus  the  spin).  The  orbital  angular 
momentum  in  the  s  state  is  equal  to  zero,  and  therefore  we  have 
here  the  law  of  conservation  of  spin  angular  momentum.  For  the 
square  of  the  spin  we  obtain 

S2=4^(c>-Kl  !  «J)  =  s(S-H)ft'=Aft«;  (18.7) 

that    is,   the   electron   spin  takes  half- integral  values  5=  1/2  (in 
units  of  ft) 


B.  PROPERTIES  OF  THE  TOTAL  ANGULAR  MOMENTUM 

OPERATORS.  QUANTIZATION  OF  THE  TOTAL  ANGULAR 

MOMENTUM.  VECTOR  MODELS 

We  shall  now  show  that  the  operators  for  the  components  of  the 
total  angular  momentum  in  the  Dirac  theory  satisfy  the  same  com- 
mutation relations  as  the  operators  for  the  components  of  the 
orbital  angular  momentum  in  the  nonrelativistic  quantum  theory 
(see  Chapter  11).  It  can  be  seen  that  the  operators  L  and  S  com- 
mute with  each  other,  because  they  act  on  different  variables. 
Therefore, 


and 


JvJy  — 


(18.9) 


The  last  two  relations  are  obtained  from  the  first  by  cyclic  per- 
mutation of  the  coordinates 

>  z  ->x. 


Since  only  the  total  quantity  (18  6)  is  conserved,  the  separation  of  the  angular  mo- 
mentum into  spin  and  orbital  parts  is  not  rigorous  in  the  general  case  This  separation  is 
found  to  be  possible  only  in  certain  special  cases  (see  Chapter  20) 
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The  operator  of  the  square  of  the  total  angular  momentum  is  seen 
to  be 

P  =  (L  +  S)a  =  L3  +  S'J-l-2(LtS;e  +  L),Sv  +  LA).          (18.10) 

which  commutes  not  only  with  the  Hamiltonian  but  also  with  any  of 
its  components,  for  example,  with  the  z  component: 

J*J,  — JitJa  =  0  (18.11) 

By  analogy  to  the  orbital  and  spin  angular  moments,  we  conclude 
that  the  square  of  the  total  angular  momentum  and  one  of  its  com- 
ponents (for  example,  Jz)  may  have  simultaneous  eigenvalues. 

The  quantization  rules  of  the  total  angular  momentum  may  be 
found  from  the  quantization  rules  of  its  orbital  component  (for  a 
spinless  particle): 

I*  ^/r7  (/+!),  (/  — 0,1,..  ),  L£  =  Hm,  (m=      I,  ....   -|-/)  (18.12) 
and  its  spin  component  (for  example,  for  t--  0) 

S*  =  aMs+l),  (s-=V*).     S,=  ft/Hi,     K  =  -U/i)         (18.13) 

This  problem  can  be  solved  exactly  in  general  form  in  the  Dirac 
theory.  For  the  sake  of  simplicity  we  shall,  however,  solve  it  in 
the  Pauli  approximation,  that  is ,  taking  into  account  the  spin  of 
the  particle  and  assuming  that  the  particle  itself  is  nonrelativistic. 
If  the  particle  moves  in  a  central  field  of  force,  the  components 
of  the  wave  function 


which  obeys  the  Pauli  equation  (see  Chapter  16),  can  be  related  by 
means  of  the  law  of  conservation  of  angular  momentum 


where  L=  r  x  p  is  the  orbital  angular  momentum  operator  and  or' 
are  the  two-component  Pauli  matrices.  We  shall  look  for  a  solution 
of  the  system  of  equations  (18.15)  in  the  form2 

V,==C1VFr~l,  il'^C.rr,  (18.16) 


2In  this  particular  choice  of  a  solution,  only  the  square  of  the  orbital  angular  momen- 
tum is  conserved,  not  its  projection  on  the  z  axis. 
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where    Y?    are  the  spherical  harmonics  (see  Chapter  11).  Then, 
since 


Eqs.  (18.15),  (18.12)  and  (18.13)  give 


or 


where 

q  =  i  (j  4- 1 )  —  /  (/  4.  i )  _  1 .  (18. 18a) 

Using  the  relations  (11.87)  and  (11.88),  we  have 

(18.19) 


<LA  +  *LV)  YT  =  -  ft  ]/</  +  1  -•-  «)  C-H  m)  Yf--          (18.20) 
Then,  taking  into  account  (18.16),  we  may  write  Eq.  (18.18)  as 

(</  -  „,  +  1  )  c,  +  V(7+  1  -  //oF+^j  c,  =  o, 


From  the  requirement  that  the  determinant  of  the  system  be  equal 
to  zero,  we  find  two  values  of  q  corresponding  to  the  two  possible 
types  of  solution 


^^  (18.22) 

1).  /  =  /-%.  ^ 


The  coefficients  C,  and  C2,  which  determine  the  relationship3  be- 
tween the  spherical  harmonics  in  the  sum  of  the  orbital  and  spin 
angular  momenta,  are  called  the  Clebsch-Gordan  coefficients. 


We  note  that  this  relationship  between  the  spherical  harmonics  is  obtained  only  in  the 
case  of  spin-orbit  interaction,  which  we  have  taken  into  account  with  the  ai*d  of  the  re- 
lations (18  IS)  If  there  is  no  interaction,  the  two  solutions  will  be  completely  independent 
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Using  also  the  normalization  condition  C;  +  Q=  1>  we  may  write 

the  first  type  of  solution  when/=/  +  -I,  /  =  0,  1,  .  .  .  .  ,  in  the  fol- 
lowing form: 

i 

F        2/1-1  _  _/ / /_i_l/.^ 

(18.24) 


For  /==/  —  !/4»  '=  If  2,  ...  (the  second  type  of  solution),  the  wave 
function  has  the  form 

1 /T— 'yw'+T  ym  - 1  I 

IJKJJT     «'       ^iV'    "",  (18.25) 

27+7    K£ 

where  Vl;)m  are  the  so-called  spherical  spinors.  The  orthonormality 
condition  for  the  spherical  spinors  can  be  written  as 

Y\»m  =  Ww*mm',  (18.26) 

where  /  =  /  +  l/2  corresponds  to  the  case  in  which  the  spin  and  orbital 

angular  momenta  are  parallel,  and  /  =  /  —  Vato  the  case  when  they 
are  antiparallel.  This  condition  follows  immediately  from  the  fact 
that  the  spherical  spinor  Y\J)^  is  a  single-row  matrix  and  from  the 
orthonormality  condition  for  spherical  harmonics.  The  spherical 
spinors  (18.24)  and  (18. 25)  are  spinor  generalizations  of  the  ordinary 
spherical  harmonics  (see  Chapter  11)  and  represent  the  angular  part 
of  the  solution  for  all  problems  involving  the  motion  of  a  particle  of 
half- integral  spin  in  a  central  field  of  force. 

Substituting  these  solutions  for  the  function  W  into  (18.15),  we 
find  that  the  component  Jz  of  the  total  angular  momentum  takes  the 

value  Jz  =  hmn  where  the  quantum  number  my  is  equal  to  /wy  =  m  —  ~. 
For  the  first  type  of  solution  (/'  =  /  +  ^)>  it  can  b6  seen  from  (18.24) 

that  it  ranges  from  —  /  (m/  =  —  l  —^  =  —  /)  to  /  -|-  1  ( mf  =  I  -|-  -5-  =  / ) » 
since  the  coefficient  at  the  function  Y?  ~  \  which  does  exist  for  the 
last  value  of  m,  vanishes.  In  exactly  the  same  way  the  number  m 

in    the    second   type   of   solution    (/  =  /—  ^)    ranges  from —/+ 1 

Thus  our  results  can  be  summarized  as  follows.    The  square 
of  the  total  angular  momentum  has  the  eigenvalues 


(18.26a) 
T'  '  =  °' 
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that  is,  it  is  quantized  similarly  to  the  orbital  angular  momentum, 
except  that  the  quantum  number  /,  which  is  called  the  total 
angular  momentum  quantum  number,  takes  half-integral  values. 4 
The  eigenvalues  of  the  component  of  the  angular  momentum  along 
any  axis  are  also  characterized  by  half- integral  quantum  numbers 


(18.27) 


From  the  relations  (18.6)  and  (18.7)  and  the  quantization  rules 
(18.26)  and  (18.27),  it  is  easy  to  obtain  quantization  rules  for  the 
scalar  products  L  •  S  and  J  •  S,  which  are  important  in  spectros- 
copy 


},  (18.28) 


and  by  analogy 


S  = 


(18.29) 


We  shall  consider  here  the  vector  model  of  the  addition  of  angular  moments.  In 
spite  of  the  lack  of  mathematical  rigor  of  this  model,  it  enables  us  to  resolve  a  number 
of  complicated  questions  and  often  gives  accurate  results. 
As  we  know,  the  orbital  angular  momentum  does  not  have 
a  specific  direction  in  space  in  quantum  mechanics.  The 
absolute  value  (square)  of  the  angular  momentum  and  one 
of  its  components,  for  example,  along  the  z  axis,  have, 
however,  simultaneous  definite  values.  These  facts  can  be 
represented  geometrically  by  an  angular  momentum  vector 
that  describes  a  cone  about  the  2  axis.  The  projection  of 
the  angular  momentum  on  the  z  axis  will  then  have  a  well- 
defined  value,  whereas  the  projections  on  the  x  and  y  axes 
remain  indeterminate.  These  arguments  apply  with  equal 
validity  to  the  spin  angular  momentum  since  it  has  the 
same  commutation  properties  as  the  orbital  angular  mo- 
mentum. The  spin  and  orbital  angular  momentum  vectors 
are  oriented  in  space  in  such  a  way  that  their  sum  forms  a 
vector  /  that  is  constant  in  magnitude.  Thus,  the  vectors 
L  and  S  do  not  have  arbitrary  directions;  they  precess  about 
J  like  two  coupled  gyroscopes. 

The  dimensionless  quantities  y*,  /*,  and  s*  are  drawn 
in  Fig.  18.1.  Each  of  the  vectors  /*  and  s*  is  defined  on 
the  surface  of  a  cone.  They  "precess"  around  J*  like  a 
coupled  system.  We  note  that,  according  to  Eq,  (18.26), 
the  addition  of  the  vector  /*  (/  =  0,  1,  2,  .  .  .)  and  the 
vector  s*  (S^H:'/*)  leads  to  &e  total  angular-momentum 
vector  J*  with  half-integral  values  of  the  total  angular  mo- 
mentum  quantum  number  j  =  I  ±  lf*. 


FIR.  18.1.  Addition  of  the 

spin    and    orbital    angular 

moments. 


The  number  /  is  also  called  the  internal  quantum  number.  This  number  was  introduced 
by  spectroscopists  before  the  discovery  of  spin  on  a  purely  empirical  basis.  It  expressed 
certain  internal  properties  ot  particles  that  were  still  unclear  at  that  stage. 

The  lack  of  mathematical  rigor  lies,  for  example,  in  the  fact  that  the  square  of  the 
vector  j  is  equal  to  }(j  \  1)  rather  than  to  y2 

From  the  standpoint  of  the  classical  theory,  this  coupling  can  be  interpreted  as  a 
coupling  of  the  orbital  motion  by  the  magnetic  field. 
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From  the  vector  model  we  can  quickly  find  a  number  of  quantities.  For  example,  we 
can  find  the  quantization  rule  of  the  angle  between  the  vectors  /*  and  s*.  From  the 
oblique  triangle  we  obtain 

cos  (t<s*)  =  -^J—  f7*2  —  /"  —  s*2},  (18.30) 

that  is, 


. 

l)s(sfl) 


C.  MOTION  IN  A  CENTRAL  FIELD  OF  FORCE  INCLUDING 
SPIN  EFFECTS.  THEORY  OF  THE  ROTATOR 

If  we  wish  to  investigate  the  motion  of  a  particle  in  a  central 
field  in  the  nonrelativistic  approximation  with  the  inclusion  of 
spin  effects,  we  must  use  spherical  spinors  Y(iJ}m  characterizing 
the  states  in  which  the  total  angular  momentum  (orbital  plus  spin) 
is  conserved,  instead  of  the  spherical  harmonics  YT  describing 
the  states  where  only  the  orbital  angular  momentum  is  conserved. 
Since  the  spherical  spinors  (in  the  nonrelativistic  approximation)  are 
composed  of  spherical  harmonics  having  the  same  quantum  number  /, 
we  obtain  the  same  radial  equation  as  fora  nonrelativistic  spinless 
particle,  that  is, 

=a!o.  (18.32) 


For  the  wave  functions  of  an  electron  in  a  central  field  we  obtain 


W\*n  =  RY\»m,  (18.33) 

where   the   spherical   spinor    Y\}]m    is   defined   by  the  expressions 
(18.24)  and  (18.25). 

In  particular,  for  the  rotator  we  may  set  r  =  a=  const  and  the 
radial  part  of  the  wave  function  /?=!.  It  is  seen  that  the  spin 
effects  in  the  nonrelativistic  approximation  do  not  give  any  ad- 
ditional terms  for  the  rotator  energy,  which  will  be  given  by  the 
same  expression  (12.7)  as  for  a  spinless  particle,  that  is, 


The  wave  function  will  be  given  by  the  spherical  spinor  Y}s]m\ 
therefore  we  must  find  the  selection  rules  for  the  quantum  num- 
bers /  ,  nij  and  y  .  These  selection  rules  hold  not  only  for  the 
rotator,  but  also  for  any  problem  of  motion  of  a  particle  in  a 
central  field  of  force  (for  example,  the  electron  in  the  hydrogen 
atom). 
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In  place   of   Eq.    (12.19),    from   which  the   selection  rules  for 
spinless  particles  were  established,  we  now  have 

(q)™'  =  jl  (V  {,'V)1  qY\']m  dQ9  (18.35) 

where  g  may  have  three  values 

q  =  z  =  cos  ft,     <7  =  #Jr*#  =  sinflc!  '*  (18.36) 


(for  simplicity  let  us  take  the  radius  of  the  rotator  equal  to  unity: 
a~l).  If  in  place  of  the  spherical  spinors  we  substitute  their 
values  from  (18.24)  and  (18.25),  the  matrix  element  (18.35)  be- 
comes 


=  D</;>  f  (YF'-1)*  qY?  ~  '  d'J  +  C"'»  <f  (Y?  ')*  </>?  dS2         (18.37) 


From  this  it  is  seen  that  the  two  integrals  in  (18.37)  agree  exactly 
with  the  integrals  in  Eqs.  (12.19)-(12.22).  The  selection  rules  for 
the  quantum  numbers  /  and  m  will  therefore  be  the  same  as  for 
a  spinless  rotator,  that  is, 


We  shall  now  find  the  selection  rules  for  the  quantum  numbers 
nij    and    /     Since     my    is    related   to   m  by  the  relation  my=w  —  y 

for    both   types  of  solutions,  the  selection  rules  of  wy  will  be  the 
same  as  for  m,  that  is, 


To  determine  the  selection  rules  for  /  let  us  consider  first  the 
case  in  which  the  transitions  occur  between  states  characterized 
by  the  same  type  of  solutions  (/'  —  /'  -  1  -  '/*  ~>  y  =  /  -|~  !/a  or  /'  =  ^  —  lli  -> 
/  =  /  —  !/a).  It  follows  from  (18.24)  and  (18.25)  that"  the  coefficients 
D</v>  and  QM  are  always  positive  and  therefore  such  transitions 
are  always  allowed.  In  this  case  the  possible  change  of;  must 
be  the  same  as  the  change  of  the  orbital  quantum  number  /,  that  is, 
A/  =  A/  =  =tl 

If  the  transitions  occur  between  states  characterized  by  different 
types  of  solution  (/  =  /'  +  Vi->y  =  /~V«  or  /  =  /'-1/i^/  =  /  +  1/*). 
then  by  taking  into  account  A/=itl,  we  obtain  three  possible 
values  for  A/  =  0,  +2,  -2.  Here,  however,  we  must  consider  the 
fact  that  the  coefficients  D1^)  and  CW  have  different  signs.  For 
A/  =  -±:2  the  two  terms  cancel  each  other,  so  that  this  transition 
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is  forbidden.  For  A/  =  0  the  difference  between  the  two  terms  is 
not  zero,  but  owing  to  the  fact  that  the  two  terms  occur  with 
different  signs,  the  intensity  of  the  radiation  will  be  weaker 
than  in  the  case  of  transitions  between  states  characterized  by 
the  same  type  of  solutions,  when  A/  =  ^t  1.  This  can  be  shown  with 
the  help  of  a  specific  example.  Let  us  suppose  that  the  initial 
state  is  /  =  /—  V*  and  the  final  state  is  /  =  /'  +  %.  We  shall 
consider  the  case  Aw  =  mf  —  w  =  0.  Then,  using  (12.22),  we  re- 
duce the  appropriate  matrix  element  (18.37)  to  the  form 


/,  m)], 
where  /'  =  /'-{-  Y*  and  /=/.  —  Ya-  Substituting  the  expressions 

DO'  -  <'  i-  vs.  y  =  /  -  1/8)  =  --^-^J  ,.,_. 

and 


from  (18.24)  and  (18.25),  and  the  expressions  for  A  and  B  from 
(12.22a),  we  find  that  the  coefficient  of  &/•./  +  !  vanishes;  that  is, 
the  transition  A/  =  —  2  is  forbidden.  At  the  same  time,  the  co- 
efficient of  8/'.  /_i  does  not  vanish,  that  is,  the  transition  A/  =  0 
is  allowed,  but  the  intensity  of  the  lines  is  weak  in  comparison 
with  A/  =  nh  1.  In  a  similar  manner  it  is  easy  to  show  that  the 
transition  A/  =  2  is  forbidden  not  only  for  q~z,  but  also  for 
(/  =  x.±iy  (Aw=;t:l). 

In  accordance  with  the  above  discussion,  the  selection  rules 
for  the  quantum  numbers  which  characterize  the  state  of  a  particle 
in  a  central  field  of  force,  when  the  spin  is  taken  into  account, 
have  the  form 


(18.39) 


1    (normal  intensity), 
(weaker  intensity). 


D.    PARITY  OF  A  STATE 

In  connection  with  the  formulation  of  the  law  of  conservation  of 
angular  momentum  for  Dirac  particles ,  we  shall  now  define  more 
clearly  the  meaning  of  the  quantum  numbers  /  and  /  in  the  Dirac 
theory  of  a  particle  in  a  central  field.  In  nonrelativistic  quantum 
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theory,  the  orbital  quantum  number  /  is  associated  with  the  square 
of  the  angular  momentum  L"  =  #*/(/  +  1),  which  is  a  constant  of 
the  motion;  therefore  in  both  Schrbdinger's  and  Pauli's  theory 
the  quantum  number  /  represents  a  quantity  that  is  constant  in 
time. 

In  the  Dirac  theory  the  orbital  angular  momentum  does  not 
commute  with  the  Hamiltonian,and  therefore  it  is  not  a  constant 
of  the  motion.  Consequently  the  quantum  number  /  has  only  an 
approximate  meaning  when  used  in  connection  with  the  law  of 
conservation  of  angular  momentum. 

It  turns  out,  however,  that  /characterizes  an  additional  property 
of  a  particle  in  quantum  theory,  namely,  the  parity  of  a  state.  By 
the  parity  of  a  state  we  mean  the  behavior  of  the  wave  function 
with  respect  to  space  inversion: 

*  =  -.*',    y==-y',    z  =  -~z'.  (18.40) 

The  parity  operator  is  defined  as  follows  7 

I*(r)  =  «K-r);  (18.41) 

that  is,  it  reverses  the  signs  of  the  space  coordinates.  The  eigen- 
values X  of  this  operator  may  be  found  by  applying  this  operator 
twice: 

Iai[»  =  X8<|>.  (18.42) 

This  double  application  of  the  parity  operator  leaves  the  co- 
ordinates unaltered.  From  (18.41)  and  (18.42)  it  follows  that 

X  =  rtl;  (18.43) 

that  is,  either  the  wave  functions  remain  unchanged  with  respect 
to  space  inversion  (even  functions,  X=  1),  or  they  reverse  their 
sign  (odd  functions,  X  =—  1). 

We  shall  now  find  the  quantities  which  determine  the  parity  of 
a  wave  function  in  a  central  field.  In  the  spherical  coordinate 
system  rt  D,  <p,  space  inversion  affects  only  the  angular  part 


|  (18.44) 

as  can  easily  be  seen  from  the  fact  that  the  sign  of  the  coordinates 
A*  =  rsin  frcos  <p,     r/  =  rsinftsin  cp,  e  =  rcosO  (18.45) 


This  operator  converts  a  right-handed  system  of  coordinates  into  a  left-handed  system, 
and  vice  versa    In  the  Dirac  theory  I0(r)  -  /030(— r). 
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changes.  The  radial  part  of  the  wave  function  remains  unchanged 
with  respect  to  space  inversion,  whereas  the  angular  part  changes 
in  accordance  with  the  relationship 


IK/m(a,  <p)  =  l7(ie  — ft,  w  +  <p)  =  constP/m(— -cos  d)*'m  <*  +  *>== 

(18.46) 


because 


Thus  the  parity  of  a  state  in  a  spherically  symmetric  field  is  deter- 
mined by  the  parity  of  the  number  / 

Furthermore,  it  can  be  seen  that  the  Hamiltonian  in  a  central 
field  remains  unchanged  under  space  inversion;  therefore,  the  in- 
version operator  I  and  the  Hamiltonian  operator  H  commute  with 
each  other.  It  follows  that  the  parity  of  a  state  is  a  constant  of  the 
motion,  since 

^I=^(HI  — IH)  =  0.  (18.47) 

The  law  of  conservation  of  parity  has  no  classical  analog,  unlike 
the  other  conservation  laws  (energy,  momentum,  angular  mo- 
mentum). Consequently,  in  nonrelativistic  wave  mechanics  the 
number  /  characterizes  two  constants  of  the  motion:  the  square 
of  the  angular  momentum  and  the  parity  of  a  state.  In  the  Dirac 
theory,  the  number  /  does  not  have  the  significance  of  the  square 
of  the  angular  momentum,  but  the  relationship  between  this  num- 
ber and  the  parity  of  a  state  is  preserved. 

We  shall  see  later  that  parity  plays  a  particularly  important 
role  in  the  physics  of  elementary  particles.  All  wave  functions— 
whether  for  one  or  more  particles— can  be  classified  as  functions 
of  odd  or  even  parity.  Dipole  transitions  can  occur  between  states 
with  different  parities.  In  the  case  of  two  or  more  particles,  the 
parity  depends  on  the  total  spin  of  the  system  and  also  on  the  type 
of  statistics  obeyed  by  the  particles.  These  concepts  will  be 
analyzed  in  some  more  detail  in  the  treatment  of  specific  examples. 


E.  SOLUTION  OF  THE  DIRAC  EQUATION 
FOR  A  FREE  PARTICLE 

Let  us  consider  the  mot  ion  of  a  free  particle  of  spin  1/2  and  con- 
stant momentum  such  as,  for  example,  an  electron.  Without  loss  of 
generality  we  may  take  the  z  axis  to  lie  along  the  direction  of 
momentum;  that  is,  in  Eq.  (17.20)  we  set 

P*  =  P,  =  0,    p,*0.  (18.48) 
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The  Dirac  equation  then  takes  the  form 

(ill  *  -I-  »•  tk  &  -  P»«VJ)  t  =0.  (18.49) 

We  shall  look  for  a  solution  of  this  equation  in  the  form 

<i>  =  -17;  be~  iltKt  +  '**,  (18.50) 

where  L3  is  the  normalization  volume,  and  the  wave  number  k  has 
the  same  value  as  in  the  nonrelativistic  Schrbdinger  theory, 
namely, 

*=2™(/i  =  0,   1,  2,  ..  ), 


and  the  4  x  4  matrix 

bl 


satisfies  the  normalization  condition 

4=1  (18.51) 


To  determine  the  quantity   e  and  the  coefficients  b^  we  use  the 
system  of  equations  (17.24),  setting 


We  then  obtain 

--  -  Ol 


Hence  we  find  two  values  for  the  quantity  e:  e=  1  (the  energy  of 
the  electron  is  positive)  and  s  =  —  1  (the  energy  of  the  electron 
is  negative);  while  we  obtain  four  values  for  the  matrix  6,  the 
components  of  which  satisfy  the  normalization  condition  (18.51). 
Two  of  the  values  of  the  matrix  b  refer  to  states  with  e  =  l: 
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and  two  of  them  refer  to  states  with  e  =  —  1 ' 


(18.54) 


that   is,   the   states    (18.53)   differ   from   the   states  (18.54)  in  the 
sign  of  the  energy. 

To  determine  the  physical  significance  of  the  states  6(/)with 
different  f=  1,  2,  3,  4,  let  us  find  the  projection  of  the  spin  on 
the  direction  of  motion,  that  is,  on  the  z  axis.  First  of  all,  we 
note  that,  since  Lz  =  xpv  — //pt  =  0  when  a  particle  is  moving 
along  the  z  axis,  the  projection  of  spin  on  the  z  axis  must  be  con- 
served. This  follows  directly  from  the  fact  that  the  matrix  a, 
commutes  with  the  Hamiltonian  in  (18.49). 

We    can   find   the   eigenvalues    of  this  operator    o3  by  applying 
it  to  the  spin  functions  b(l}.  We  have  then 


(1  000 
0-10  0 
0010 
0  00—1 


I__  I 
*,_  " ' 
K 


_    =  '  &">;      (18.55) 

?„     I  •<- 


that  is,  for  this  solution  the  eigenvalue  of  the  operator  !/2  a,  equals 
Vi  •  In  exactly  the  same  way  it  is  easy  to  show  that 

Io3^>==:--l^,    -J.o3&<a>  =!&<»>  and   lo^=»-lft(4). 

Thus,  the  four  possible  states  correspond  to  the  four  possible  com- 
binations of  the  sign  of  the  energy  and  the  spin  direction.  The  solu- 
tion bn}  corresponds  to  positive  energy  (e=  1)  and  the  projection  of 
this  spin  along  the  positive  z  direction  (s=  I).  In  a  similar  way  we 
have  e=  1,  s~  —  1  for  the  solution  //*'and  we  have  e  =  —  1,  .$=  1 
and  £~  —1,  s  =  —  1  f or  the  solutions 6(:t>andb(i),  respectively,  where 
s  is  double  the  projection  of  the  spin  in  the  direction  of  the  mo- 
mentum. 

In  the  nonrelativistic  limit  (v  **>  c),  the  wave  functions  <|>3  and^4 

will  be   of  the  order  of  -^~~  times  the  wave  functions  ^  and^ 
(t|)3^  — •{/,)  for  positive  energy  states  (s=  1).  For  states  with  nega- 
tive energies  [see  (18.54)],  on  the  contrary,  <h~     fy. 
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We  have  considered  the  special  case  of  motion  of  a  particle  along  the  z  axis.  This 
does  not  restrict,  however,  the  generality  of  the  investigation  of  the  general  motion  of  a 
particle.  Whenever  the  direction  of  momentum  is  characterized  by  spherical  angles  & 
and  9,  it  is  always  possible  to  choose  a  primed  coordinate  system  in  such  a  manner 
that  the  z1  axis  is  directed  along  the  momentum.  Then,  by  carrying  out  two  rotations, 
one  rotation  through  an  angle  d  around  the  y'  axis  directed  perpendicularly  to  the  22' 
plane,  the  other,  second  rotation  through  an  angle  <f  around  the  z  axis,  we  may  transform 
from  the  solution  in  the  primed  coordinate  system  (momentum  along  the  z'  axis)  to  the 
general  case  (direction  of  momentum  characterized  by  the  spherical  angles  &,  <p  X 

Using    the    fact    that    under    a  rotation   of  the  coordinate  system  the  wave  function 
changes  in  accordance  with  (17.39),  we  may  write  the  solution  for  this  general  case 


*  -I-  °-»a) 


(18.56) 


which  is  a  generalization  of  Eqs.  (18.50),  (18.53)  and  (18.54). 

Jgrgblem  18. 1.  3now  that  the  energy  and  momentum  operators  E  =  ill  ^  and  p  =  —  in  V, 
respectively ,Transform  like  a  four- vector  under  the  Lorentz  transformations: 


J2 

c 


where 


=  -7!^       P^=P;-  P.- 


?  =  ~,    E'^i/fcgry  t  and  so  on. 


Hint.    Use  the   Lorentz  coordinate   transformation  [Eq.    (17.34)]  and  change  to  new 
variables  in  the  process  of  differentiation. 

Problem  18.2.  prove   the  relativistic  invariance  of  the  scalar  equation  for  a  free 
particle.        •"""ili 

Hint.    First,  let  us  prove  the  invariance  of  the  operator  relation 

E'8  —  c2p'8  =E2— cV 

by  using  the  results  of  the  preceding  problem. 
-i*"-V 

igblem 1 JAJ Show   that   in   the   case   of  spatial  rotation  of  the  coordinate  system 
i  by  the  angle  0  ,  the  wave  function  transforms  according  to  the  relation 


Hint.  Use  the  method  which  leads  to  the  relation  (17.39). 


J^r°Wem  lft^  With  the  aid  of  the  vector  model  of  addition  of  angular  moments, 
fin<r~the  angles  between  J*  and  s*  and  between  ;*  and  /*  taking  into  account  the  geo- 
metric vector  addition  in  quantum  mechanics;  that  is,  find 

cos  (;*s*);     cos  (;*/*) 


These  problems  also  refer  to  the  material  in  Chapter  17 
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Hint.  Use  a  method  similar  to  the  one  that  led  to  Eqs.  (18.30)  and  (18.31X 

*~*     *^N 

roblem  18.5}  Show  that  the  wave  function    £=C<|>*,  where   C  =  /aap8,    and  fy*is 
njugate  (but  not  the  Hermiaan  adjoint,  that  is,  nott^)  of  the  Dirac  wave 
function  for  an  electron  with  negative  energy  satisfies  the  Dirac  equation  with  positive 
energy    and   opposite   (positive)  sign  of  the   charge,  that  is,  describes  the  motion  of  a 
positron  (a  charge  conjugate  transformationX 
Solution.  The  Dirac  equation  is 


for  the  complex  conjugate  of  this  equation,  we  may  write  (taking  into  account  the  fact  that 

a;  =  olt  a*  —  —  a.,,  a  j  =  a<f   pj  =  p,): 


We  note  that  the  complex  conjugate  ^*  differs  from  <],+  (the  Hermitian  conjugate):  namely, 

k*i 

I1 

whereas 


Let    us    substitute    <f  ==  /«»?»'!'*   into  the   complex  conjugate   of  the  Dirac  equation.  We 
then    find    that    $     satisfies    the    Dirac  equation  if  the  charge  e  Is  replaced  by  -  e. 


jh 

Since  the  state  fy  (r9  t )  =  e  <|>  (r)  is  treated  as  a  state  with  positive  energy  (E  =  |  E  \  )t 

and    the    state   fy*  (rtt)  =  e  <j>*  (r)    is    treated    as  a  state  with  negative  energy 

(£  —  — 1£  | ),    we    must    interpret    the    sign  of  the  energy  in  "fy  differently  than  in  ty\ 


Chapter  19 


The  Dirac  Equation  in  Approximate  Form 

In  many   problems   which  are  solved  by  the  Dirac  theory,  we 

(V  \fl 
—  ]  in  the 

final  results.  Therefore  we  may  immediately  write  the  Dirac  equa- 

/  v  \a 
tion  in  an  approximate  form,  retaining  quantities  of  the  order  of  (  —  j  . 

It  will  be  shown  below  that  the  role  of  both  the  relativistic  and  spin 
terms  is  clearly  displayed  in  this  approximation. 

Let  us  consider  the  motion  of  an  electron  with  positive  energy 
£"">()  in  an  electromagnetic  field  which  does  not  depend  on  time.  Ih 
this  case  we  may  replace  the  energy  operator  by  its  eigenvalue, 
separating  out  the  rest  mass  energy  mtoc': 

£->w0c'2  +  E.  (19.1) 

The  wave  equation  (17.20)  then  becomes 


(E  -  e-I.)    ;=c  (a'-P)     ;  ,  (2/v*  +  E  -  *»)    ;  =c  K-P)    ;  ,     (19.2) 

where  a'  stands  for  2x2  Pauli  matrices  [see  (16.18)],  and  P  = 
p  —  —A.  Equations  (19,2)  are  simply  a  different  form  of  the  exact 
Dirac  equation.  For  the  components  of  fy,  we  obtain  from  (19.2) 

(E  -  fid>)  i,  -  C  (P.v  -  iPJ  ^  -  cP&  -  0 

and  so  on.  This  equation  can  also  be  obtained  from  the  first  of  Eqs. 
(17.24),  if  we  substitute  into  it  both  (19.1)  and  (17.25). 

As  was  mentioned  in  the  preceding  section,  in  the  nonrelativistic 
limit   the   components  tyA   and  ^4   are  "small"  for  positive  energy 

states,  since  they  are  of  the  order  of  y  times  the  "large  functions 

£1  and  <|>a.  The  transition  to  the  approximate  Dirac  equation  consists 
in  eliminating  the  "small"  components  -^  and  'l»l  from  Eqs.  (19.2) 

/  V  \* 

and  retaining  terms  of  the  order  of  (  —  j  in  the  remaining  equations 
for  the  "large"  components  ^  and  fy.  Thus  from  (19.2)  we  obtain 
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First   of   all   we  change  from  the  four- component  wave  functions 
^i\ 
r  )  to  the  two-component  functions  f  \jr*  I  by  setting 


<19*4> 

where  N  is  the  normalization  coefficient.  This  coefficient  may  be 
determined  from  the  "  renormalization"  relationship 


;)  .  (19.5) 


Since  the  "small"  wave  functions    "!»»  and   ti  occur  as  squares  in 
(19.5),  we  may  set 


that  is,  in  (19.3)  we  neglect  second-order  terms  and  replace  P 
by  p.  This  change  of  operators  is  permissible  in  calculating  the 
normalization  coefficient  since  they  differ  from  one  another  by  a 
first-order  term  (inversely  proportional  to  the  velocity  of  light). 
When  applied  to  second-order  terms  (squares  of  the  "small" 
wave  functions),  this  term  of  P  gives  only  third-order  terms, 
which  we  discard.  Then,  substituting  Eqs.  (19.4)  and  (19.6)  into  the 
left-hand  side  of  (19.15)  and  using  the  equation1 


(or'-  a)(a'.  b)  =  (a  •  b)+  i'  •  («xft),  (19.7) 

which  holds  both  for  the  Pauli  and  Dirac  matrices,  we  find 


(19.8) 

From  this  we  obtain 


In  order  to  prove  this  equality,  we  may,  without  loss  of  generality,  assume  that  vector 
a  is  directed  along  the  z  axis  (ax  -  ay  -  0,  az  a),  and  that  vector  b  is  located  in  the  zx 
plane  (by  -  0)  Then  we  obtain 


(a'  -  a)(a'  •  b)    - 

a  •  b     \    i  \a  -  (b  *  <y')l    -    a  •  b    \     i[o'  •  (a  x  6)] 
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This    approximation   gives   us,    in   accordance  with  (19.3)  and 
(19.4), 


8m»c" 


Substituting  (19.9)  into  the  first  of  Eqs.  (19.2)  and  neglecting  terms 
of  the  order  of  (  v  /  c  )3  ,  we  find 


Using  the  relation  (19.7),  we  have 

(0'.  P)(o'.P)  =  PI-f  t[0-(P  *  P)]= 
=  P1-(^l(px>»)-f(^xp)])=P1- 

-PJ-T.°'-^)- 

where  //is  the  magnetic  field  since  the  operator  v   acts  only  on  the 
vector  potential  A,  and  not  on  <J>. 

In  exactly  the  same  way,  with  the  aid  of  the  relations 


',  p)  (E  -  eto)  (a'-  p)  =  (E  -  *<D)  p*  ~  iAe  (a'-  £)  (^  p)  == 


and 

-,P*     ^p^(£-^)  =  (E^^D)P2-l-2f-  £-p  -|-/^V^.       (19.13) 

where  £  =  —  v(^    is  the  electric  field  intensity,  we  may  reduce  the 
Dirac  equation  (19.10)  to  the  following  approximate  form: 


(19.14) 

The  left-hand  side  of  Eq.  (19.14)  describes  the  motion  of  a  particle 
with  a  nonrelativistic  velocity  in  a  stationary  electromagnetic 
field.  The  right-hand  side  of  (19.14)  contains  an  additional  inter- 
action energy  that  describes  the  relativistic  and  spin  corrections. 
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The  first  term  on  the  right-hand  side  of  (19.14) 

^  <19'15) 

takes  into  account  the  correction  due  to  the  relativistic  velocity  of 
the  particle.  A  similar  additional  energy  must  also  appear  in  the 
relativistic  Klein-Gordon  equation.  The  classical  analog  of  this 
term  will  be  obtained  if  the  relativistic  expression  for  the  Hamil- 
tonian  is  expanded  in  a  series,  retaining  terms  of  the  order  of 


The  second  term  on  the  right-hand  side  of  (19.14)  may  be  written 
as 

Kmag  =  —  ji-//.  (19.16) 

From  this  it  is  clear  that  the  quantity  JLI  =  ~?n    <r'  may  be  treated 

as  the  Dirac  magnetic  moment  of  an  electron,  which  appears  ex- 
plicitly in  the  nonrelativistic  approximation  only  through  this 
transition.  This  interaction  energy  turns  out  to  be  of  the  order  of 
v/c.  From  the  intrinsic  angular  momentum  [Eq.  (18.6)]  of  the 
electron 


S  =  -o',  (19.17) 

we   find  the  relationship  between   S  and  \i  that  is  required  by  ex- 
periment and  follows  automatically  from  the  Dirac  theory 


The  next  term  of  the  expansion  characterizes  the  so-called  spin- 
orbit  interaction 

*'  -(£x  P)l,  (19.19) 


which  describes  the  interaction  of  a  moving  magnetic  dipole  with 
an  electric  field. 

This  interaction  may  also  be  interpreted  from  the  classical  point  of  view  in  the  follow- 
ing way:  a  magnetic  dipole  moving  with  a  velocity  i>  (the  spatial  component  of  a  tensor 
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quantity)  acquires  an  additional  electric  moment  (space-time  component  of  the  same 
tensor  quantity) 

l»el  =  7«xl»=  ,,!ocpx|i'  (19.20) 

which  interacts  with  the  electric  field  of  the  nucleus.  This  additional  energy  of  inter- 
action is 

Vcl  =-  *.|lel   =  -..  -^  |a'.(£xp)]  (19.21) 

This  classical  expression  for  the  interaction  energy  is  twice  as  large  as  the  correspond- 
ing quantum  expression  [see  (19.19)].  Even  before  the  advent  of  the  Dirac  theory,  an 
attempt  was  made  to  explain  the  fine  structure  by  the  semiclassical  introduction  of  spin- 
orbit  interaction.  To  obtain  an  agreement  with  experiment.  Thomas  and  Frenkel  sug- 
gested that  we  substitute  the  coefficient  1/2  into  the  classical  expression  for  the  inter- 
action energy  (19.21).  This  interaction,  which  follows  automatically  from  the  Dlrac 
theory,  is  called  the  Thomas-Frenkel  correction. 

In  particular,  for  the  Coulomb  field  of  a  nucleus 

(19.22) 

The  interaction  between  the  moving  magnetic  dipole  and  the 
nucleus  according  to  (19.19)  becomes 

"-*=  --*&--  S'L'  (19-23) 

where  S=  n  o'/2  is  the  spin,  and  L~rxp  is  the  orbital  angular 
momentum. 

We  note  that  there  is  no  spin-orbit  interaction  for  an  atom  in 
the  s  state  since  the  orbital  angular  momentum  in  this  state 
vanishes.  Finally,  the  last  term  of  the  interaction,  which  in  the 
case  of  the  Coulomb  field  is  equal  to3 

<19-24> 


is    called  the   contact   interaction.    The  additional  energy  corre- 
sponding to  it 


=  f 


(19.25) 


2J.  I.  Frenkel,  Wave  Mechanics,  New  York  Oxford  University  Press,  1938,  Vol.  2, 
see  also  (16.11),  where  the  coefficient  1/2  appears  in  the  magnetic  moment  produced  by  a 
moving  charge. 

In  the  derivation  of  (19  24),  we  have  used  the  fact  that,  according  to  (4  78), 

V21    =   -47r8(r)   -   -  4778  (x)  S  (v)  8  (2)  (19  24a) 
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is  proportional  to  I  ^  (0)  |2,  and  it  will  differ  from  zero  only  for  the  5 
state  since,  according  to  (13.28a),  only  in  this  case  |*F(0)|*  ^  0. 
For  all  other  states  ('  ^  0)  this  square  of  the  wave  function  vanishes 
when  r  =  Q.  In  this  sense  the  contact  term  may  be  regarded  as 
the  spin-orbit  interaction  for  the  s  state.  We  can  see  now  that 
the  last  two  terms  in  the  interaction  energy  (19.14)  characterize 
the  spin  properties  of  an  electron. 

r~ — v 

Problem  19, 1J  Show  that  the  matrix  ca  is  the  velocity  operator,  and  that  In  the  case 
Pfi^""PS!rtlcle  a  is  not  a  constant  of  the  motion,  unlike  the  momentum  operator 
*#v) .  Explain  this  difference.  Determine  in  what  case  the  average  velocity 


=  \ 


yd  x ,  and  so  on, 

will  be  related  to  the  average  momentum  by  the  classical  relation 

v=c*^.  (19.26) 

Hint.    From  the  H am il Ionian  (17.21)  it  is  possible  to  obtain  the  velocity  operator 


It  can  also  be  shown  that  the  velocity  is  not  a  constant  of  the  motion,  since  o  ^  0  .  Con- 
sequently, if  for  a  given  k  we  take  a  linear  combination  of  positive  and  negative  energy 
states  [see  Eqs.  (18.53),  (18.54)]  there  will  exist  interference  terms  that  will  fluctuate 
with  time  (~*±2<c/rt).  As  a  result,  Ehrenfest's  theorems  will  hold  only  on  the  average 
in  the  Dirac  theory.  The  interference  terms  will  disappear  in  the  calculation  of  the 
average  value  of  the  momentum  operator. 

Equation  (19.26)  holds  only  if  the  states  with  positive  energies  are  retained  (e=  1). 


Chapter  20 

The  Fine  Structure  of  the  Spectra  of 
Hydrogen-like  Atoms 

A.  STATEMENT  OF  THE  PROBLEM. 

The  problem  of  the  motion  of  an  electron  in  a  hydrogen-like  atom 
(Kepler's  problem)  is  rightfully  considered  as  the  touchstone  of  all 
forms  of  quantum  theory.  There  are  two  main  reasons  for  this. 
First,  it  has  great  physical  significance,  since  the  problem  of 
motion  in  a  Coulomb  field  can  be  solved  exactly.  Second,  the 
results  may  be  compared  with  experiment  to  a  high  degree  of 
accuracy;  for  example,  the  emission  spectra  of  atoms  can  be 
observed  by  optical  and  microwave  spectres  copy. 

The  solution  of  the  problem  of  motion  of  an  electron  in  a 
Coulomb  field  of  a  nucleus  (hydrogen  atom)  on  the  basis  of  the 
Schrodinger  equation,  obtained  in  Chapter  13,  gives  an  expression 
for  the  energy 

£i  =  _*Jf,  (20.1) 

which  is  in  good  agreement  with  experimental  data.  This  ex- 
pression for  the  energy  may  be  taken  as  the  zero-order  approxi- 
mation. A  more  detailed  study  of  atomic  spectra  shows,  however, 
that  the  spectral  lines  have  a  fine  structure  which,  of  course,  must 
be  associated  with  the  detailed  structure  of  the  energy  levels.  The 
Schrodinger  theory  does  not  give  an  adequate  description  of  the 
regularities  frequently  occurring  in  spectra,  since  it  neglects  at 
least  two  important  facts:  the  relativistic  dependence  of  mass 
upon  velocity  and  the  spin  properties  of  the  electron.  Both  these 
facts,  as  we  already  know,  are  accounted  for  by  the  Dirac  theory, 
and  therefore  application  of  the  Dirac  equation  to  the  Kepler 
problem  gives  results  that  accurately  describe  the  multiplet 
structure  of  energy  levels. 

As  was  pointed  out,  the  Kepler  problem  can  be  solved  exactly 
in  the  Dirac  theory.  The  solution,  however,  requires  many 
tedious  calculations  (much  more  complicated  than  in  the  Schr'odinger 
theory,  because  in  this  case  we  have  not  one  but  four  equations). 
Moreover,  in  the  course  of  these  calculations  one  does  not  always 
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perceive  the  physical  meaning  of  the  results,  the  analysis  of  which 
is  of  primary  importance  to  us.  We  shall  therefore  use  a  more 
elementary  method,  based  on  the  approximate  equations  of  the 
preceding  section.  This  method  not  only  enables  us  to  obtain 
formulas  characterizing  the  fine  structure  up  to  terms  of  the 

(V    \2 
—  1  '      but    also   to   interpret  the    individual  terms  as 

manifestations  of  relativistic  or  spin  properties  of  the 
electron. 


B.    RELATIVISTIC  AND  SPIN  EFFECTS 

As  follows  from  Chapter  18  [see  (18.24)  and  (18.25)],  the  wave 
function  of  a  particle  obeying  the  Dirac  equation,  taking  into 
account  the  spin  properties,  is 

W  =  R«Ypm.  (20.2) 

Here  Rnl  is  the  radial  part  of  the  wave  function  and  Y\^  is  a  spher- 
ical spinor:  for  y  =  /-|-l/*the  spin  is  parallel  to  the  orbital  angular 
momentum  and  for  /  =  /  —  V*  it  is  antiparallel. 

Although  terms  of  the  order  of  (vt'cY  are  not  formally  accounted 
for  in  Eq.  (20.2),  the  relationship  between  the  spherical  harmonies 
in  the  spherical  spinor  that  determines  the  zero-order  approxima- 
tion of  the  wave  function  is  established  by  the  spin-orbit  interaction, 
which  is  of  the  order  of  (vjcY* 

The  spherical  spinor  can  therefore  be  used  only  when  the  atom 
is  not  subject  to  external  perturbing  forces  of  magnitude  greater 
than  those  involved  in  the  spin-orbit  interaction.  If  that  is  not  the 
case,  the  spin-orbit  coupling  will  be  disrupted  and  a  new  set  of 
premises  must  be  set  forth  in  order  to  establish  a  relationship 
between  the  spherical  functions. 

Spherical  spinors,  just  as  spherical  harmonics,  satisfy  the 
equation 

vi.^ja  =  -'('  +  »)^  (20.3) 

therefore,  taking  into  account  (11.17),  the  radial  function  in  (20.2) 
satisfies  the  same  equation  that  was  derived  in  the  nonrelativistic 
Schrbdinger  theory: 


1  There  are  several  cases  in  quantum  mechanics  in  which  a  small  interaction  energy 
enables  us  to  find  a  relation  between  the  coefficients  of  the  functions  in  the  zero-order 
approximation  We  have  already  encountered  a  similar  situation  in  the  treatment  of  the 
Stark  effect  (see  Chapter  14). 
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The  wave  function  (20.2)  completely  determines  the  selection 
rules  for  all  quantum  numbers.  The  selection  rules  for  the  quantum 
numbers  /,  /,  and  m;  are  given  by  formula  (18.39),  while  the  selec- 
tion rules  for  the  principal  quantum  number  n  will  evidently  be 
the  same  as  in  Schrbdinger's  theory  [see  (13,  48c)],  since  the  radial 
function  remains  unchanged.  Considering  all  this,  we  obtain  the 
following  selection  rules  for  a  theory  of  the  hydrogen-like  atom 
which  takes  into  account  spin  effects: 

A/  =  ±l,  A/  =  0,  itl,  Amy=0,  ±1   (AM  is  an  integer).   (20.4) 

As  for  the  expression  for  the  energy,  we  cannot  restrict  ourselves 
in  this  problem  to  its  nonrelativistic  value  (20.1),  since  the 
latter  does  not  determine  the  fine  structure  of  the  energy  levels. 

Knowing  the  zero-order  approximation  of  the  wave  function 
(20,2),  and  also  the  additional  perturbation  energy  describing  the 
relativistic  [see  (19.15)]  and  spin  [see  (19.23)  and  (19.24)]  effects, 
we  may  find  the  energy  levels  characterizing  the  multiplet  struc- 
ture of  the  spectrum. 

According  to  formula  (19.15),  the  relativistic  correction  to  the 
energy  levels  is 


=  __  J  (tfwj'-Jj--.  ww<Px.  (20.5) 

Since  in  the  present  case 


(20.6) 

we  see  that  this  additional  energy  will  be  independent  of  the  solid 
angles    tt,  <p;  that  is,  integrating  over  the  solid  angle  we  get 

(20.7) 

Then  the  additional  energy  characterizing  the  relativistic  effects 
is 


\prel  — *      f//7o 

aC  9»,   r*  IW« 


__ 

°  n  3\ 

(20.8) 


where  a  =  ~  gzl  Vn?  is  the  fine  structure  constant. 


'He 
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In  the  derivation  of  (20.8)  we  have  used  (13.29a) 


Equation  (20.8)  agrees  exactly  with  the  formula  for  the  rela- 
tivistic  energy,  which  was  calculated  in  identical  approximation  by 
means  of  the  relativistic  Klein-Gordon  equation  [see  (15.31)]. 

In  a  similar  manner,  with  the  aid  of  (19.23),  we  find  the  addi- 
tional energy  due  to  the  spin-orbit  interaction 


°-=  ^  (S  •  L)  (/-').  (20.9) 

Using  expression  (13.29a)  for  (r*) 


and  expression  (18.28)  for(S  •  L) 


S.L   =*   for   /-O, 
o     for   /  =  o, 


we  obtain  the  following  value2  for  the  energy  (20.9): 

^s-°-^-^n^w^r  (20-10> 

In  these  equations 


At  first  glance  it  may  seem  that  the  spin-orbit  interaction,  which  is  inversely  propor- 
tional to  the  third  power  of  the  distance,  cannot  give  a  stable  state.  This,  however,  is  not 
so.  At  small  distances  the  spin-orbit  interaction  behaves  just  like  the  relativistic  inter- 
action, that  is,  it  is  inversely  proportional  to  the  square  of  the  distance.  Indirect  proof  of 
this  is  the  fact  that  Aft8  °  differs  from  Ah'rel  only  by  a  numerical  factor  of  the  order  of 
unity 
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and  the  quantity 


Finally,  the  energy  cor  responding  to  the  contact  interaction,  accord- 
ing to  (19.24),  is  given  by 


where 

|V(0)p=/?i(o)y}ft+yjft.  (20.13) 

Furthermore,  considering  the  expression  for 


[see   (13.28a)],   and  using  the   fact  that  |K^|2=      when/  =  0  and 
/  =  V2.  we  find 


that  is, 

A£cont=/?ft^!8/0.3  (20.15) 

From  this  we  obtain  the  following  expression  for  the  additional 
energy  which  accounts  for  the  relativistic  effects,  the  spin-orbit 
and  contact  interactions: 

A£  =  A£  rel  {-A£s-°*-f-  A£cont  — 

<i_n*/p)  __  -j    1 
V.)(7+l)  /0J' 


Incidentally,  Eq.  (20.10)  for  the  contact  interaction  may  be  obtained  when  the  expres- 
sion for  the  spin-orbit  interaction  (Eq.  20.10)  is  allowed  to  go  to  the  limit  as  /  ~>  0,  if  we 
discard  the  factor  6/0  in  20.10.  Therefore,  many  authors  use  this  procedure  and  neglect 
the  contact  interaction  in  deriving  the  fine-structure  formula.  However,  the  agreement  be- 
tween the  two  formulas  is  accidental  since  for  the  a  states  the  numerator  of  Eq.  (20.10)  is 
always  zero,  while  the  denominator  vanishes  only  in  the  nonrelativistic  approximation.  In 
a  number  of  other  problems,  for  example,  an  atom  containing  several  electrons,  the  energy 
associated  with  contact  interaction  is  no  longer  a  limit  of  the  expression  for  the  spin-orbit 
interaction. 
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Substituting  here  the  value  of  q  from  (20.11),  we  have4 

(20.16) 


Therefore,  summing  both  results  [(20.1)  and  (20.16)],  we  obtain 
the  fine  structure  formula  for  the  spectrum  of  a  hydrogen-like  atom5 

(20.17) 


From  this  it  is  seen  that  the  splitting  of  the  levels  is  proportional 
to  the  square  of  the  fine  structure  constant. 


C.    THE  FINE  STRUCTURE  IN  THE  DIRAC  THEORY 

When  we  take  the  fine  structure  into  account  the  position  of  the 
energy  levels  in  the  hydrogen  atom  is  found  to  depend  also  on  the 
total  angular  momentum  quantum  number  /.  Therefore  the  terms 
will  be  denoted  in  the  following  manner: 

<20'18> 


From  this  formula  it  is  seen  that  the  fine  structure,  according 
to  the  Dirac  theory,  depends  only  on  the  principal  quantum  number 
n  and  the  total  angular  momentum  quantum  number  y.  In  contrast  to 
the  Klein-Gordon  theory,  it  is  independent  of  the  orbital  angular 
momentum  quantum  number  /  (up  to  terms  of  the  order  of  a*). 


4This  problem  can  be  solved  exactly  in  the  Dirac  theory.  We  then  obtain  a  closed  for- 
mula for  the  energy  levels,  in  this  formula,  the  first  expansion  term  (which  is  independ- 
ent of  a2)  gives  the  nonrelativistic  formula  (20.1).  The  second  term,  which  is  proportional 
to  a  ,  gives  the  additional  energy  (20.16).  The  third  expansion  term,  which  is  proportional 
to  a  m  this  approximation,  can  be  neglected,  since  it  is  smaller  than  the  so-called 
vacuum  corrections,  which  are  proportional  to  a  (see  Chapter  22). 

An  exact  solution  of  the  Dirac  equation  gives  the  following  generalization  of  Eq 
(15  30),  which  takes  into  account  the  relativistic  effects  in  the  case  where  spin  is  also 
present: 


r          zv        >* 

1 1 — — , - 1 

L     (n_,-y^VcMy$)2-zV)2J 


=   mflc2      1  . ~-~  -m0c2  (20  17a) 


Equation  (20  17)  may  be  obtained  from  (20  17a),  if  the  latter  is  expanded  in  a  series  and 
we  restrict  ourselves  to  the  first  two  terms 

Since  the  minimum  value  of  j  is  equal  to  1/2,  we  find  that  stable  motion  in  the  Coulomb 
field  of  a  point  nucleus,  according  to  the  Dirac  theory,  will  extend  to  Zcr  -  137,  whereas 
ui  the  Klein-Gordon  theory  it  was  limited  by  Zcr  -  -  •  137  [see  (15  33)1  Such  an  increase  of 
Zcr  is,  as  we  have  already  mentioned,  due  to  the  slight  compensation  of  the  relativistic 
effects  by  the  spin  effects 
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The  diagram  given  in  Fig.  20.1  shows  that  all  terms  are  doubly 
split,  since  to  each  value  of  /  there  correspond  two  values  of  /; 
for  example,  instead  of  a  single  term  2/>  (/=-!)  we  now  have  two 

terms    I2pi  .   and  2/j.  _,.    The  exceptions 
-  E/h--0  are    the    ,s   terms   (/  —  0),    for  which    j 

can  have  only  one  value  (j  =  *  ';).   Thus 

the  relativistic  and  spin  effects  some- 

what  reduce  but  do  not  split  the    s  terms 

(see  Fig.  20.1). 
'  The     degree     of    degeneracy    also 

changes  owing  to  splitting  of  the  energy 

—  —2pi/2  levels.     We    know    that   the   principal 

2s1/?t2pr/2  quantum  number  may  take  the  following 

values:    ;j~l,  2,  3,  4,....     The    orbital 
_  js  angular  momentum  quantum  number  / 

varies  from  /  =  0  (s1   state)  to  /=//—!. 

FIR.  20.  i.    Energy    hwoi  dia  The  total  angular  momentum  quantum 

gram    of   the   hydrogen   atom  number  /  takes  the  values/  —  /  r*  -  J/2  (/  -/.  0) 

and  j—i/^l  —  U)  and,  finally,  the  mag- 

netic quantum  number  mf  —  -  -  j,  .  .  .  ,  -}  -  /,  that  is,  for  a  given  /  there 
are  2/+1  half-  integral  values  of  mr  The  degree  of  degeneracy, 
which  is  characteristic  for  any  central  field  of  force  and  is  related 
to  the  equivalence  of  the  various  directions  in  space,  is  therefore 
equal  to  2/-|-  1  for  particles  with  a  spin  of  1/2  (we  remember  that 
for  spinless  particles  it  was  equal  to  21  \-  1).  in  contrast  to  the  rela- 
tivistic spinless  theory,  the  degeneracy  with  respect  to  /  is  still 
present  when  we  take  into  account  terms  of  the  order  of  a2  and 
even  the  following  expansion  terms  proportional  to  a4.  When  the 
finite  size  of  the  nucleus  is  taken  into  account,  the  degeneracy  with 
respect  to  /  is  removed.  We  note  incidentally  that  even  greater 
splitting  with  respect  to  /  is  due  to  vacuum  fluctuations  (see 
Chapter  22).  The  magnitude  of  the  splitting  of  spectral  lines  can 
be  determined  from  the  selection  rules  (20.4).  For  the  Lyman 
series  we  then  have  two  lines  (instead  of  a  single  one): 


(weak  line,  since  A/=°)  (20.19) 

U)U»=(l$i   ,)  —  (/!/?<  ,). 

The  Balmer  series  lines  are  split  as  follows: 

w«>=i2siJ)-(/i/i,,J), 
(i><*>  =  (2*i  J  —  (np*  ,), 
w«)=(2^/J)-.(wsl3), 
co»>=(2/?i;J)~  (nsi  J, 
io<*'  =(2/>.  ,)  —  (/;</,  .),  (20.20) 
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and,  finally,  the  transition  2pi  ,  ->nd*>,  is  forbidden,  because  in  this 
case  Ay  —  2.  If  the  degeneracy  with  respect  to  /  is  not  removed, 
the  lines  w(1)  and  wl4)  coincide,  since  the  initial  and  final  levels  have 
the  same  values  of  the  principal  quantum  number  /*  and  the  total 
angular  momentum  quantum  number  j.  In  a  similar  manner  we 
may  determine  the  splitting  of  all  other  lines.  The  lowest  split 
energy  level  corresponds  to  n  —  2.  Let  us  consider  in  greater 
detail  the  splitting  of  this  level  in  the  case  of  the  hydrogen  atom 
(Z  =  l),  which  is  the  one  most  carefully  investigated  experi- 
mentally. In  general,  the  n  =  2  level  would  be  split  into  three  sub- 
levels,  and,  according  to  our  theory,  two  of  these  sublevels  would 
combine: 


(20.21) 

The  transition  frequency  between  these  levels  is,  according  to  the 
Dirac  theory, 

Au)°  =  (2/)i/a  )  —  (2pv,  )  =  R  ft  ,  (20.22) 

which  is  about  1.095-  104  Mc.b  If  only  the  relativistic  effects  are 
taken  into  account  (Klein-Gordon  equation)  the  corresponding 
splitting  is  [see  (15.32)] 

AW  K-C   =  (2s)  -  (2/>)  =  {  ^  ;  (20.23) 

that  is,  the  frequency  is  almost  three  times  greater  than  the  one 
found  from  the  Dirac  theory.  Consequently,  the  spin  properties 
of  particles  somewhat  reduce  the  influence  of  the  relativistic 
effects. 

The  conclusions  of  Dirac'  s  theory  have  been  accurately  con- 
firmed by  experiment. 

It  is  interesting  to  note  that  the  fine  structure  of  the  spectrum 
of  the  hydrogen  atom  was  first  theoretically  calculated  by  Sommer- 
feld  who  applied  a  relativistic  Hamiltonian  to  the  steady  states 
of  the  Bohr  classical  theory.  Sommerfeld  obtained  [see  (2.61)] 


1  Me  -  10     sec      ,  that  is,  the  angular  frequency  (j),  expressed  in  sec      ,  is  connected 
with  the  frequency  v,  expressed  in  Me,  by  the  relation 
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the  following  expression  for  the  relativistic  theory  (20.22)  without 
taking  into  account  the  spin  effects: 

Aa)Somm-=(2S)-(2p)  =%'.  (20.24) 

Agreement  of  the  Sommerfeld  result  with  the  conclusion  of  Dirac's 
theory  was,  however,  only  accidental.  Sommerf eld's  theory  did 
not  take  into  account  the  spin  effects,  and  therefore  it  was  unable 
to  predict  the  splitting  of  the  rc  =  2  level  into  three  sublevels,  the 
presence  of  which  was  later  confirmed  experimentally. 


D.    EXPERIMENTAL  VERIFICATION  OF  THE  FINE 
STRUCTURE  THEORY 

The  major  accomplishment  of  Dirac's  theory  was  its  treatment 
of  the  fine  structure.  The  theory  was  in  good  agreement  with  the 
experimental  facts  and  was  able  to  explain  this  structure  as  a 
manifestation  of  the  relativistic  and  spin  effects  caused  by  the 
motion  of  the  electrons  within  the  atom.  However,  further  and 
more  detailed  studies  showed  divergencies  between  the  theory  and 
fact.  Thus,  special  attention  was  given  to  the  2si/2  and  2pi/2  levels 
which,  according  to  the  Dirac  theory  [see  (20.21)],  should  coincide 
in  a  hydrogen  atom.  Among  spectroscopists,  doubts  about  the 
validity  of  this  conclusion  were  expressed  as  early  as  1934.  How- 
ever, the  techniques  of  the  time  did  not  allow  greater  experimental 
accuracy,  and  the  discrepancy  between  the  theory  and  optical 
observations  (that  is,  the  splitting  of  the  levels)  being  small,  no 
great  attention  was  paid  to  it.  Better  experimental  data  on  this 
splitting  were  obtained  consider  ably  later,  when  microwave  spectro- 
scopic  techniques  were  used. 

The  microwave  spectroscopic  method  was  invented  and  rapidly 
developed  in  the  postwar  years  as  a  result  of  technical  progress  in 
microwave  engineering.7  Microwave  spectroscopy,  which  has  now 
developed  into  a  special  branch  of  physics,  gives  valuable  results 
when  used  in  the  investigation  of  nuclei,  atoms  and  molecules. 
Microwave  spectroscopic  methods  are  also  applied  to  the  physics 
of  solids  and  liquids.  In  1947  Lamb  and  Rutherford  employed  this 
method  to  studies  of  the  2si/a  and  2/?i/d,  levels,  making  use  of 
a  special  property  characteristic  of  the  2si7j  state.  This  state  is 


By  microwave  ultrahigh- frequency  radio  emission  we  mean  the  region  of  the  electro- 
magnetic spectrum  located  in  the  wavelength  range  from  millimeters  to  tens  of  centimeters 
(10°  -  10  Me).  Successful  application  of  microwave  spectroscopy  to  the  investigation  of 
atomic  spectra  is  due  to  the  fact  that  the  distances  between  the  components  of  the  levels 
split  by  the  relativistic,  spin  and  vacuum  effects  are  of  the  same  order  of  magnitude  as  the 
wavelengths  in  the  microwave  region. 
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metastable,  since  a  dipole  transition  from  the  2$1/a  state  to  the 
lower  lsi/a  state  is  forbidden  by  the  selection  rules  A/ =  0  [see 
Eq.  (20.4)].* 

Transition  from  the  metastable  state  may  be  associated  either 
with  the  emission  of  two  photons  (the  probability  of  such  a  transi- 
tion is  108  lower  than  that  for  the  allowed  transition),  or  with 
a  preliminary  transition  to  the  2p  level.  Lamb  and  Rutherford 
investigated  the  latter  type. 

Let  us  describe  the  general  features  of  their  experiment 
(see  Fig.  20.2).  A  beam  of  hydrogen  atoms  in  the  unexcited  Isi/, 
state  is  obtained  as  a  result  of  dissociation  of  molecular  hydrogen 
at  high  temperatures  (tungsten  furnace).  A  bombarding  beam 
of  electrons  then  excites  some  fraction  of  the  atoms  in  the  hot 
beam  (approximately  one  out  of  108)  to  the  metastable  state  2si/2. 
The  metastable  atoms,  unlike  the  unexcited  atoms,  readily  give 
up  their  energy  of  excitation  upon  striking  a  metallic  target.  In 
so  doing  they  remove  electrons  from  the  metal.  The  resulting 
current  is  measured  by  a  sensitive  galvanometer. 


Fig.  20.2.  Diagram  of  the  Lamb- Rutherford  ex- 
periments on  the  detection  of  the  splitting  of 
the  2s t/,  and  2py,  levels:  1)  tungsten  furnace 
emitting  a  beam  of  hydrogen  atoms;  2) beam  of 
electrons  exciting  the  hydrogen  atoms,  3) radio 
frequency  field;  4)target,  5)  galvanometer 

If  the  beam  of  metastable  atoms  is  subjected  in  transit  to  a 
perturbation  capable  of  causing  a  2s->2;?  transition,  then  the  atoms 
will  almost  instantaneously  pass  to  the  lsi/s  state  (prior  to  reaching 
the  target).  As  a  result,  the  current  reading  on  the  galvanometer 
is  lower. 

In  the  Lamb-Rutherford  experiment  such  transitions  were 
induced  by  microwave  radiation  (the  probability  of  the  corre- 
sponding spontaneous  transition,  proportional  to  w4,  is  vanishingly 
small  as  a  consequence  of  the  smallness  of  w);  a  strong  damping 


This   is   correct  for  a   dipole  transition,    but   calculation   shows   that  the  quadrupole 
transition  between  these  states  is  also  forbidden 
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action,  resulting  in  a  decrease  of  target  current,  was  observed  at 
some  frequency  cw.  This  o>  was  assumed  to  be  the  resonance 
frequency  which  causes  transitions  2si/Jt  ->2pi,,  or  2si/,  -+2p>/,  with 
a  subsequent  practically  instantaneous  transition  to  the  lsi/2  level; 
the  energy  difference  between  these  corresponds  to  fiw.  Thus, 
one  can  very  precisely  measure  the  relative  positions  of  the 
levels 

2si/a2/;i/a  and  2;;^  9. 

These   measurements    showed   that   the   level    2$i/2   is    shifted 
upwards    relative  to  the  level    2/>./.  by  approximately  one  tenth  of 

the  distance  between  the  doublet  levels  2/>i/,  — 2/h/,,  which  is  equal 

a 
to  y^  /?.    The    arrangement   of  the  levels  of  a  hydrogen-like  atom 

(n  =  2)  derived  from  the  Lamb-Rutherford  experiment  is  given 
in  Fig.  20.3.  The  disposition  of  these  levels  according  to  the 
Dirac  theory  is  given  for  comparison.  According  to  the  latest 
data,  the  shift  of  the  2si/,  level  is  approximately  1057.77  Me 
or,  in  wavelengths,  ~28  cm. 


OKI 


•?*> 


7/2 


Fig    20.3.    Splitting  of  energy   levels   in   the  hydrogen 
atom     rj)  experimental  data,   /))  according  to  the  Dirac 
theory    (neglecting   vacuum   effects)     The   frequencies 
of  the  corresponding  transitions    and    the    distances 
are  given  in  Me. 

This  apparently  negligible  discrepancy  between  theory  and  ex- 
periment led  to  remarkable  progress  in  theoretical  physics  and, 
in  particular,  in  quantum  electrodynamics.  This  subject  will  be 
considered  in  greater  detail  in  Chapter  22. 


In  the  experiments  of  Lamb  and  Rutherford,  the  frequency  of  the  microwave  radiation 
was  fixed  and  the  resonance  condition,  corresponding  to  the  difference  in  the  Zeeman  com- 
ponents between  the  states  2M/2  and  2pi/2  or  2p3/,  was  obtained  by  adjusting  the  magnetic 
field  &%*  Then,  extrapolating  the  results  to  the  case  o%"  =  0,  the  authors  found  the  level 
shift. 
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E.    ANOMALOUS  ZEEMAN  EFFECT 

The  complete  theory  of  the  Zeeman  effect  (both  normal  and 
anomalous)  must  be  based  on  the  Dirac  theory,  because  the  latter 
takes  into  account  both  the  relativistic  and  the  spin  corrections. 
Since  the  anomalous  Zeeman  effect  is  due  to  the  spin  effects  in 
the  atom,  neither  classical  theory  nor  SchrbdingerTs  wave  me- 
chanics was  able  to  give  a  satisfactory  explanation  of  the  Zee- 
man effect,  and  for  obvious  reasons. 

As  a  starting  point  of  the  theory  let  us  take  the  approximate 
Dirac  equation  (19.14),  in  which  these  effects  are  taken  into  account 


up  to  the  terms    (y-.   Let  the  magnetic  field  be  directed  along  the 

z    axis,   that  is,    HX—-IIV  =  Q,  H  ..  =  >'%'.    Then,  using  the  fact  that, 
according  to  (16.16), 


'  (20.25) 

we  reduce  Eq.  (19.14),  describing  the  motion  of  an  electron  in  the 
Coulomb  field  of  a  nucleus,  to  the  form 


;;H^^  <20-26) 

where  Krel,  Ks-°-  and   Vcont  are  given  by  Eqs.  (19.15),  (19.23)  and 
(19*24),  respectively.  Upon  averaging  of  these  terms 

x  (20.27) 


(20.28) 


we  obtain  the  fine  structure  formula  (20.16),  that  is,10 


When  a  magnetic  field  is  present,  we  obtain  on  the  right-hand 
side  of  Eq.  (20.26)  the  interaction 

(20.29) 


10Generally  speaking,  the  spin-orbit  interaction  Vs "°"  is  of  fundamental  importance  in 
this  case.  Since,  however,  the  relativistic  terms  are  of  the  same  order  as  the  spin-orbit 
interaction,  we  may  set 
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which  gives  the  following  value  for  the  additional  energy  of  the  atom: 

A/I  ™«=  lv5r  J  (*;«•;)  (-  i  |-  -f  a.;)  (  J;)  d1*.        (20.30) 

The  appearance  of  either  anomalous  (case  of  a  weak  magnetic  field) 
or  normal  (case  of  a  strong  magnetic  field)  Zeeman  effects  depends 
on  the  relative  proportion  between  the  additional  energies  on 
the  right-hand  side  of  (20,26). 

Let  us  assume  that  we  have  a  comparatively  weak  magnetic 
field,  whose  interaction  with  the  atomic  electrons  is  smaller  than 
the  relativistic  or  spin-orbit  interaction. 

Then  the  zero-order  approximation  will  be  expressed  by  the 
wave  functions  (20.2)  that  are  obtained  when  the  spin-  orbit  coupling 
is  retained. 

Substituting  these  functions  into  (20.30),  the  additional  energy 
becomes 


A£  mag=  ^&T  J  |  Rnf  |i  r*  dr  ,|  dQ  (r,;>r  t       t  £  +  ,;)  Y<».  (20.31) 

0 

In   (20.31)    we  should  note  that  the  integral  over   r  is  equal  to 
unity 


/?n,|V  <//•=!.  (20.32) 

i 

Substituting  in  place  of  the  spherical  spinors  their  values  from 
(18.24)  and  (18.25),  and  using  the  orthogonality  condition  for  the 
spherical  harmonics 


we   find   the   following  expression  for  the  additional  energy  when 

/  =  '  +  '/.: 

A£mag  _  [(t  _)_  m) 


In  exactly  the  same  way  when  /==/  —  '/,  we  obtain 
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Recalling  that  m,  =  m  —  V«»  the  last  two  expressions  may  be  written 
as  a  single  formula 


A£  maS  = 


(20.33) 


where 


°W  *• 

=  -£~  is  the  Larmor  frequency,  and  the  Lande  g-  factor  is 


(20.34) 


Thus,  in  the  case  of  the  anomalous  Zeeman  effect,  the  expression 
for  the  additional  energy  contains  the  Lande  g  factor,  which  in  the 
case  of  the  normal  Zeeman  effect  [see  (16.23)]  is  equal  to  unity.. 

The  additional  energy  (20.33)  does  not  lead  to  the  usual  triplet 
splitting  (normal  Zeeman  effect),  but  to  a  more  complex  splitting 
pattern  (anomalous  Zeeman  effect). 

In  view  of  the  fact  that  mj  can  assume  2/-f-l  different  values, 
each  level  in  the  case  of  the  anomalous  Zeeman  effect  is  split 
into  2/4 1  separate  sublevels;  that  is,  the  external  magnetic  field 
completely  removes  the  degeneracy,  which  is  present  even  in  the 
relativistic  theory  of  the  hydrogen  atom. 


3-2 


-1/2 


.S£L. 


-1/2 


Fig.  20.4.   Zeeman  effect:  a)  position  of  the  energy  levels 

in   the   absence   of  a   field,    h)  anomalous    Zeeman   effect, 

<•)  normal  Zeeman  effect 


To  obtain  the  splitting  pattern,  it  is  necessary  to  take  into 
account  the  value  of  the  Lande  g  factor  (g  —  2  for  the  si/,,  states, 
g  =  */8  for  the  ;>i/.,  states,  g  =  4/;,  for  the  P)/»  states,  and  so  on)  and 
also  the  selection  rules  for  the  magnetic  quantum  number  w7.  In 
particular,  when  Aw;=0,  the  emitted  components  are  polarized 
parallel  to  the  z  axis  (that  is,  parallel  to  the  magnetic  field),  and 
when  Am;  =  ±  1  the  components  are  polarized  perpendicular  to  the 
magnetic  field. 


328  RELATIV1STIC    QUANTUM    MECHANICS 

Equation  (20.33)  gives  us  the  following  value  for  the  frequency 
of  the  radiation: 

u>  =  o)()  -f  o  (g°m;  —  gm,),  (20.35) 

where  <»„  is  the  frequency  of  the  radiation  in  the  absence  of  a  mag- 
netic field  (  J/"  —  0);  #M  and  £  are  the  Lande  g  factors  of  the  initial 
and  final  states;  the  magnetic  quantum  number  m;  of  the  final  state 
may  take  three  values:  w,  -=  w;,  m1;  '  1. 

Figure  20.  4b  shows  the  splitting  of  the  spectral  levels  1'si  ,  and 
2'pi  ,  in  a  weak  magnetic  field,  the  Lannor  frequency  being  taken 
as  the  unit  of  the  splitting.  From  Fig.  20.4b  it  is  seen  that  in  this 
case  there  are  four,  and  not  three  (as  in  the  normal  Zeeman  effect) 
shifted  lines.  The  magnitude  of  the  displacement  is  given  by 
(20.35). 

In  the  case  of  a  weak  field  [according  to  (20.34)]  we  find 


Hence 


.K-Jr,.    x  =---;!"  (2°-36) 

Equation  (20.34)  for  the  I  ande  ^factor  is  applicable  to  the  hydrogen 
atom  and  to  atoms  having  a  single  valence  electron.  In  the  general 
case,  the  Lande  g  factor  becomes 


where   L,  S,  and  /  are  the  orbital,  spin  and  total  angular  momenta 
of  the  atoms  and 

J  =  \L±S\. 

In  particular,  for  elements  of  the  first  group  (/  =  /,  L  =  l,  5  =  !/«)» 
Eqs.  (20.37)  and  (20.34)  are  identically  the  same.  The  Lande  g 
factor  attains  its  maximum  value  for  s  states  (1  =  0,  j  =  s=l/t): 

gs  =  2  (20.38) 

For  atoms  with  two  electrons  in  the  outer  shell  (for  example, 
helium  atoms),  single  lines  (5  =  0,  J  =  L)  are  possible  along  with 
the  triplet  state  S=l.  For  the  single  lines  we  have  g=  1,  and, 
therefore,  in  this  case  spin  effects  should  be  of  no  importance! 
only  the  normal  Zeeman  effect  (that  is,  triplet  splitting)  should  be 
observed  in  either  a  weak  or  strong  field. 
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F.    STRONG  MAGNETIC  FIELDS.   PASCHEN-BACK 
EFFECT 

It  has  been  indicated  that  the  anomalous  Zeeman  effect  appears 
in  the  case  of  weak  fields,  when  the  external  magnetic  field  cannot 
disrupt  the  spin-orbit  coupling, 

Mathematically  this  means  that  AEmag  [see  (20.33)]  is  much 
smaller  than  the  natural  splitting  of  the  lines  AES*°'  given  by  Eq. 
(20.28) 


AEs-<v  -AE^ae.  (20.39) 

In  the  latter  case  we  first  solved  the  problem  by  taking  into  account 
the  spin-orbit  interaction;  this  establishes  a  relation  between 
the  spherical  harmonics  that  form  the  spherical  spinor  (18.24)  or 
(18.25);  then  we  found  an  additional  energy  that  leads  to  the 
anomalous  Zeeman  effect,  since  the  Lande  j»  factor  does  not  equal 
unity. 

In  the  case  of  strong  fields,  when  the  splitting  due  to  the  external 
magnetic  field  is  greater  than  that  due  to  the  spin-orbit  interaction 

AEroaJV-AE8-0-,  (20.39a) 

the  magnetic  field  "breaks"  the  spin-orbit  coupling  and  the  zero- 
order  approximation  solutions,  expressed  in  terms  of  spherical 
spinors  [see  (18.24  and  18.25)],  are  no  longer  true. 

In   this    case   we    may    neglect  the  interactions  V/rel,    Vs*0-  and 
l/cont  in  (20.26),  which,  when  (20.29)  is  taken  into  account,  becomes 


Using  the  fact  that  the  functions  vFj  and  W2  must  be  proportional  to 
the  spherical  harmonics  Yj"f  with  —i  ^-  yyi  =  mYyi>  we  find  two 
independent  equations  for  these  wave  functions: 


_  (20.41) 

from  which  it  is  evident  that  the  additional  energy  equals 


(20.42) 

and 

A  (&Ems)m*£=p^  (Am 
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that  is,  the  wave  function  ^i  corresponds  to  the  case  in  which  the 
electron  spin  is  directed  along  the  magnetic  field  (ms  =  */t),  and  the 
wave  function  lFa  corresponds  to  the  case  in  which  the  spin  direc- 
tion is  opposite  to  that  of  the  magnetic  field. 

If  we  choose  m  such  that  the  same  energy  value  is  obtained 
for  both  functions,  then  we  must  set  m1  =  m  —  1  for  the  functional  , 
and  m,1^=m-\-l  for  the  function  Wj. 

In  this  case  the  wave  functions 


(20.43) 

will  be  mutually  orthogonal,  so  that 

dr  ,  d<2  (Y?-  I)<  Yf  HI  =0.        (20.44) 


Since  the  interaction  between  the  atom  and  the  external  magnetic 
field  (20.29)  contains  only  the  matrix  a',,  which  does  not  couple  the 
wave  functions  llr,  and  W,f  transitions  from  the  state  with  m,=  Va  to 
the  state  with  ms^  —  V*  »  induced  by  this  interaction,  will  be  for- 
bidden in  this  case  and  hence  A/?z?  —  0 

Taking  into  account  this  circumstance,  and  also  the  selection 
rules  for  the  quantum  number  m  (A/7/  =  0,  :».!),  we  find  from  (20.42) 
an  expression  for  the  Zeeman  splitting  of  the  spectral  lines 

<\ui  =  oAm  =  Of  -±ot  (20.45) 

which  agrees  with  the  result  of  Schrodinger's  theory,  which  explains 
the  normal  Zeeman  effect  (triplet  splitting  of  the  spectral  lines). 

Thus,  in  strong  fields  (A£ma£^>  A£s-°-),  the  anomalous  effect 
becomes  the  normal  effect,  which  is  in  agreement  with  experi- 
mental data  (Paschen-Back  effect).  It  is  interesting  to  note  that 
the  passage  from  the  anomalous  Zeeman  effect  to  the  normal  effect 
can  be  illustrated  by  Fig.  20.4,  if  the  Lande  g  factor  is  set  equal  to 
unity  (see  case  c).  Then  the  splitting  will  be 


that  is,  we  obtain  three  components  of  the  split  line  instead  of  four. 
In  special  cases,  when  AESt°*<A£nia8  for  one  energy  level  and 
conversely  ££»•<>•>  AEmft*  for  ^e  0^er  ieveif  or  when  the  energies 
of  both  levels  are  of  the  same  order  of  magnitude,  the  Zeeman 
splitting  becomes  complex.  Since  these  are  all  special  cases,  we 
shall  not  elaborate  them  here. 
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roblem  20.  IJ  Investigate  the  dlamagnetism  and  paramagnetism  of  atoms  by  placing 
homogeneous  magnetic  field  (H±*=Hy*»QtH  ;  —  &%");  contrary  to  the 
investigation  of  the  Zeeman  effect,  keep  all  terms  containing  g%^as  well  as  terms  pro- 
portional to  <2%^  in  the  Hamiltonian  [see  (20.25)]. 

Indicate  the  atoms  in  which  diamagnetism  may  be  observed. 

Solution,    When  terms   portional   to  e^^are  taken  into  account,  we  have,  instead  of 
(20.25) 


Consequently,  on  the  right-hand  side  of  (20.26)  we  have  another  term 


which  in  conjunction  with  (20.33)  gives  the  following  expression  for  the  additional  energy 
of  an  electron  in  a  magnetic  field: 


Here  when  calculating  the  perturbation  energy  proportional  to  i2  -l-.V3,  we  have  used  the 
spherical  symmetry,  which  must  occur  in  the  zero-order  approximation  (,.iy  -=0),  and 
have  set 

» 

Hence  the  magnetic  moment  of  an  atom  in  a  magnetic  field  is 

atom          dA/imag  elWr*         param  ,      diam 

1        =--=----  = 


The  latter  relationship  is  a  generalization  of  a  familiar  equation  relating  A/,maS 
in  the  case  where  A/^mag  is  a  nonlinear  function  of  ,  ajtf\ 

The  diamagnetism  of  atoms  is  characterized  by  the  second  term  of  Eq.  (20.47),  which 
is  proportional  to  ^j/T 

The  magnetic  susceptibility  per  gram-atom  is 


where  A^  is  Avogadro's  number. 

The  quantity  /.diam  jg  never  zero  and  must  always  be  negative  (ra  j>0).  'Iherefore, 
the  diamagnetic  effect  must  occur  in  all  atoms. 

As  for  the  first  term  on  the  right-hand  side  of  Eq.  (20.47),  which  is  proportional  to 

my,  it  may  take  either  positive  or  negative  values,  since  my  ~  _«„  .j*  ,    i  ~~  ,   ...  ,  ±  j. 

In  a  state  of  thermodynamic  equilibrium,  however,  the  negative  values  for  trip  which 
give  a  smaller  value  for  the  energy,  will  be  preferred.  On  the  average  we  therefore 
obtain  a  positive  value  for  the  paramagnetic  susceptibility  l  > 

param_  [t^2_  /(;+  1) 
~~  kT          3 

Expression  (20.47)  is  obtained  for  a  weak  magnetic  field,  when  the  anomalous  Zeeman 
effect  occurs.  It  can,  however,  be  easily  extended  to  the  case  of  a  strong  magnetic  field. 
To  do  this  we  must  set  ff  =  1,  m;  =  m  in  (20.47)  [see  (20.42)]. 
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Since  the  paramagnetic  susceptibility  is  considerably  larger  than  the  diamagnetic 
susceptibility  (XParam  >  /  diam)t  the  atoms  exhibit  paramagnetic  properties  when  j  -+.  0. 
For  hydroger-like  atoms  ;  differs  from  zero  (the  minimum  value  of  /  equals  l/s)  and, 
therefore,  they  are  always  paramagnetic.  Only  for  atoms  with  an  even  number  of 
electrons  can  the  quantum  number  /  vanish  (for  example,  parahelium  in  the  ground  state, 
see  Chapter  24).  Such  an  atom  will  be  diamagnetic. 


Adding  geometrically  the  orbital  and  spin  angular  momenta,  show  that 
eeman  effect  is  associated  with  the  fact  that  the  total  magnetic  moment 

|i  is   not  parallel  to  the  total  angular 
momentum  J.    With  the  aid  of  the  geo- 
metric model,  also  explain  the  Paschen- 
>V     /  -.  Back  effect. 

^   w  \  Solution.    First  let  us  find  geometri- 

cally the  angular  and  the  magnetic  mo- 
WH  ment  vectors 


Fig    20  5    Geometric   interpretation  of  the 
anomalous  Zeernan  efiect 


In  the  geometric  representation  (see 
Fig.  20.5)  'we  may  choose  the  scale  so 

that  u  —  —  ^~—    (it  is   immaterial  at 
'  2rn0c 

the    moment   whether    the   vectors    are 
parallel  or  antiparallel). 
Then 

|i,=-L,  and  |i  =  2S 


that  is,  the  total  magnetic  moment  ji  will  undergo  two  rotations  in  a  magnetic  field:  one 
with  an  angular  velocity  <•>  around  the  direction  of  the  total  angular  momentum  (this 
angular  velocity  corresponds  to  the  frequency  associated  with  the  transition  between 
components  of  the  spin-orbit  splitting  of  the  spectral  lines  <o  -  J\<*~),  and  the  other  corre- 
sponding to  the  Larmor  frequency  of  precession  around  the  direction  of  the  magnetic 
field  H  (//,  =  //„  =  0,  II s  =  ,jn- 

When  (D  >  o,  the  additional  energy  should  be  calculated  from  the  relation 


J3 


Since  the  magnetic  moment  is  directed  on  the  average  along  J,  we  have 


f   W 
where  o  =  -"        ,  and  the  Lande  g  factor  is  equal  to 


I  S 

g  =  -j.  cos  (L  »y)  +  2  -j  cos  (5- 


Substituting  the  values  of  the  cosines  of  the  angles 


2JL 


and  remembering  that  J3—  WJ  (J  ±  1),  and  so  on,  we  obtain  the  expression  (20. 37)  for 
the  Lande'  g  factor.  Let  us  pay  attention  to  the  fact  that  if  the  vectors  |i  and  J  were 
parallel,  the  Lande'  g  factor  would  be  unity, 
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In  strong  fields  o  >  <o  we  must  consider  independently  the  rotation  of  the  orbital  and 
the  spin  moments  ji  around  the  z  axis.  Then  the  additional  energy  becomes 


=  -  ji.^  =^|™  (U  +  2SJ  =  ofc  (m  +  1), 
which  leads  directly  to  the  normal  Zeeman  splitting  (Paschen-Back  effect). 

/Problem  20.3^  By  means  of  the  relativisuc  scalar  wave  equation  and  the  Dirac  equa- 
tion, MIR!  'HIS 'frequency  of  the  allowed  transitions  between  the  n  ™-  2  and«  =  3  states. 
Show  that,  according  to  the  Dirac  theory t  there  are  seven  lines,  five  of  which  are  distinct, 
and  that,  according  to  the  scalar  theory,  there  are  only  three  distinct  lines. 


Chapter  21 

The  Effect  of  Nuclear  Structure 
on  Atomic  Spectra 

A.    INTRODUCTORY  REMARKS 

As  has  been  mentioned  in  Chapter  13t  the  position  of  the  spectral  lines  is  shifted  when 
the  finiteness  of  the  nuclear  mass  is  taken  into  account.  The  Rydberg  constant  /?  in  the 
expression  for  the  energy  of  a  hydrogen-like  atom 


(21.1) 

n  \  *•  > 

is  somewhat  reduced  and  becomes  equal  to 

*  =  *„(!  -$)  PI.2) 

where 


is  the  Rydberg  constant  corresponding  to  infinite  nuclear  mass.  Consequently,  the 
Rydberg  constant  will  have  somewhat  different  values  for  hydrogen,  deuterium,  and 
tritium.  With  the  great  accuracy  attainable  in  modern  spectroscopic  techniques,  this 
effect  can  be  used  to  detect  the  presence  of  different  isotopes  (see  Chapter  13). 

In  a  similar  fashion,  the  finite  size  and  the  magnetic  moment  of  the  atomic  nucleus 
have  certain  effects  on  atomic  spectra, 

B.    EFFECT  OF  THE  FINITE  SIZE  OF  THE  NUCLEUS 

When  the  motion  of  an  electron  in  the  field  of  a  nucleus  is  treated  as  a  problem  in 
classical  theory,  it  is  quite  immaterial  whether  the  nucleus  is  regarded  as  a  point  or  as 
a  particle  with  finite  dimensions.  All  that  matters  is  that  the  electron  should  at  all 
times  be  outside  the  nucleus  and  that  the  nuclear  charge  should  be  spherically  sym- 
metric, since  the  potential  outside  a  spherically  symmetric  charge  distribution  is  the 
same  as  the  potential  of  a  point  charge. 

In  quantum  mechanics  the  situation  is  somewhat  different.  The  wave  function  must 
differ  from  zero  inside  the  nucleus;  therefore,  there  is  a  certain  probability  (however 
small)  that  the  electron  will  be  located  inside  the  nucleus.  Consequently,  the  charge 
distribution  inside  the  nucleus  must  in  some  manner  influence  the  energy  levels  of  the 
electrons  in  the  atom. 

To  estimate  the  effect  of  the  finite  nuclear  size  on  the  energy  spectrum  of  a  hydrogen- 
like  atom,  we  shall  assume  that  the  nucleus  can  be  represented  by  a  sphere  of  radius 
y?jV  with  charge  distributed  uniformly  throughout  the  volume.  The  potential  energy  will 
be  given  by  (see  Fig.  2U1) 
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=  -  for 


(2U) 


The  shift  of  energy  levels  due  to  the  finite  size  of  the  nucleus  can  be  calculated  with 
the  help  of  perturbation  theory.  We  shall  assume  that  the  perturbation  energy  consists 
of  the  difference  between  the  potential  energy  of  a  point  nucleus  and  the  potential  energy 
of  a  nucleus  with  charge  uniformly  distributed  over  the  nuclear  volume 


71-iH 


(21.5) 


with 


1     for    r  <  RNt 
0     for    r>/?Ar. 


The  perturbing  force  therefore  differs  from  zero  only  inside  the  nucleus. 


(21.6) 


Fig.  21.1  Potential  energy  with  the 
finite  dimensions  of  the  nucleus  taken 
into  account  The  dashed  curve  shows 
the  variation  of  the  potential  energy 
which  would  be  obtained  inside  the 
nucleus,  if  the  potential  were  de- 
scribed by  the  Coulomb  law  both  inside 
and  outside  the  nucleus 


In  first-order  perturbation  theory  the  shift  of  the  levels  is  given  by 
AEvol  —  I  <|,*vol  tyd*Xt 

Since  |  ty  Is  does  not  change  appreciably  in  the  region  r  ^  /?,  this  integral  can  be  readily 
evaluated  by  substituting  for  |  ^  j"  its  value  at  the  origin. 


(2J.8) 
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Substituting  the  expression  (20. 14)  for  |  fy  (0)  |3 ,  we  obtain 


that  is,  in  nonrelativistic  theory,  the  shift  of  the  energy  levels  is  different  from  zero  only 
for  s  states  (/  =  0;. 

IRj\r\* 
It  can  be   shown  that  for  p  levels  this  shift  will  contain  a  factor  of  the  order  I  -  -  I 

a  /A*   \ 4  \an/ 

(where  art  —  a0~  ),  and  for  d  states  a  factor  of  the  order  of  I  ^j  ,  and  so  on.1  Con- 
sequently, the  shift  in  the  energy  levels  for  p  and  d  states  can  be  neglected  in  the  first 
approximation. 

For  hydrogen,  the  first-order  shift  in  the  energy  levels  is  about  1  Me;  this  is  much 
too  small  to  account  for  the  Lamb  shift,  which  is  equal  to  approximately  1,057  Me  (see 
Chapter  20X 

The  volume  of  the  nucleus  is  important  in  connection  with  the  isotope  shift,  that  is, 
the  shift  in  the  energy  levels  of  atoms  with  the  same  atomic  number  Z  and  different 
mass  numbers  A.  The  chief  factors  that  give  rise  to  the  isotope  shift  are  the  different 
masses  (the  mass  effect)  and  different  volumes  (the  volume  effect)  of  the  isotopes.  The 
mass  effect  is  manifested  in  a  shift  of  the  spectrum  lines  towards  the  ultraviolet  as  the 
mass  number  A  increases.  For  example,  for  Z  —  1  the  highest  frequencies  are  found 
in  tritium  with  A  =  3.  then  deuterium  with  A  =  2",  and,  finally,  ordinary  hydrogen  with 
A  =  1  [see  (21.2)  and'  also  Chapter  13].  On  the  other  hand,  the  volume  eliect  is  mani- 
fested in  a  shift  of  the  spectrum  lines  towards  the  infrared  as  A  increases.  For  instance, 
it  can  be  seen  from  Eqs.  (21.1),  (21.9)  and  (21.13)  [see  below],  that  the  energy  levels  of  a 
hydrogen-like  atom  will  be  given  by  the  following  expression  when  the  shift  due  to  the 
volume  effect  is  taken  into  account: 


RZ*hl         4   Z-/?8    Vi\ 
n*~V~~5   1/T"  /• 


Experiment  shows  that  an  isotope  shift  towards  ultraviolet  is  observed  in  elements  whose 
atomic  number  Z  is  less  than  40-50.  For  elements  with  a  larger  value  of  /  an  Isotope 
shift  in  the  opposite  direction  is  observed,  that  is,  towards  the  infrared.  This  indicates 
that,  for  relatively  light  elements,  the  isotope  shift  is  caused  mainly  by  the  mass  effect, 
whereas  for  heavier  elements  it  is  caused  by  the  volume  effect.  This,  however,  is  a 
rather  simplified  picture  of  the  isotope  effect,  and  other  features  associated  with  the 
structures  of  the  atom  (for  example,  nuclear  spin  and  polarization  of  nuclei  by  electrons) 
also  have  to  be  taken  into  account. 

C.    MESIC  ATOMS 

The  finite  size  of  the  nucleus  has  a  particularly  important  effect  on  the  position  of 
energy  levels  in  a  meslc  atom— an  atomic  system  consisting  of  a  u  meson  revolving  about 
a  nucleus.  The  u  meson  is  a  particle  that  has  the  same  spin  as  the  electron  (that  is,  spin 


To  calculate  the  shift  of  the  p  levels  we  must  substitute  the  second  term  m  the  ex- 
pansion of  |0|(r)  into  (21  7),  namely, 

2 

dr      f  -  o  ' 
since  the  main  term  |i^(0)|2  vanishes  for  p  states 
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1/2  in  units  of  ft)  and  a  mass  207  times  greater  (m|X  =  207/H0),  so  that  the  }i  meson  is 
basically  a  "heavy"  electron.  Me  sic  atoms  can  be  produced  by  passing  negative  JLI  mesons 
through  matter.  After  losing  its  energy  and  slowing  down,  a  p,  meson  may  be  captured 
In  an  orbit  about  a  nucleus,  forming  in  this  way  a  ji-mesic  atom.  Mesic  atoms  have 
been  obtained  for  almost  all  elements  of  the  periodic  system,  from  hydrogen  up  to 
the  heavy  elements  (uranium,  neptunium,  and  so  on).2 

The  motion  of  a  fi  meson  about  the  nucleus  is  determined  mainly  by  the  Coulomb 
attraction,  just  like  the  motion  of  an  electron.  A  u  meson,  however,  also  has  nonelectro- 
magnetic  interactions  with  the  electron—  neutrino  and  nuclear  fields;  these  interactions 
may  result  in  spontaneous  decay  of  the  p,  meson  into  an  electron,  neutrino,  and  antineutrino 
(the  lifetime  of  a  jz  meson  at  rest  is  T  =  2.2  x  10"°  sec).  The  p.  meson  has,  In  addition, 
a  definite  probability  of  being  captured  by  a  nucleus.  Ihus  the  lifetime  of  a  u-mesic 
atom  is  determined  by  two  competing  processes:  natural  decay  of  a  JLI  meson  into  an 
electron,  neutrino,  and  antineutrino  and  nuclear  capture  of  the  ji  meson.  In  light  mesic 
atoms  (  Z  <:  10)  the  probability  of  the  first  process  is  greater  than  the  probability  of 
the  second;  that  is^  the  lifetime  of  a  mesic  atom  is  determined  by  the  lifetime  of  a 
ji  meson  at  rest  (T^10~°  secX  For  Z>10,  nuclear  capture  begins  to  predominate 
and  the  lifetime  slowly  decreases  to  ^  7  -  10~8  sec  (for  Z  =  81>).3 

In  the  theory  of  }i  -mesic  atoms,  ordinary  electrostatic  interaction  plays  a  funda- 
mental role.  In  the  first  approximation,  we  can  regard  the  nucleus  as  a  point  charge 
and  calculate  the  energy  of  the  mesic  atom  and  the  radius  of  the  orbit  using  the  equations 
derived  for  an  ordinary  hydrogen-like  atom,  replacing  the  electronic  mass  by  the  mass 
of  a  p,  meson.  Then  the  energy  and  radius  of  the  orbit  will  be  given  by  [see  (13.33), 
(13.45)] 


where  tn^  =  207///0  .  It  can  be  seen  that  the  energy  of  the  u  meson  in  a  mesic  atom  is 
207  times  greater  than  the  corresponding  energy  of  an  electron  in  the  atom,  and  that  the 
radius  of  the  orbit  on  the  contrary  is  reduced  by  the  same  factor.  If  electrons  remain 
in  the  atom  along  with  the  p.  mesons,  they  will  move  about  the  nucleus  in  considerably 
larger  orbits  than  the  p.  meson  and  therefore  cannot  exert  a  significant  Influence  on  the 
p,  meson  rotating  around  the  nucleus.  A  mesic  atom,  therefore,  may  be  regarded  as  a 
hydrogen-like  atom  that  can  have  both  large  and  small  values  of  Z. 

Since  the  radius  of  the  "Bohr"  orbit  of  the  meson  is  207  times  smaller  than  that  of 
the  electron  orbit,  the  probability  that  the  meson  will  be  located  in  the  nucleus  is  con- 
siderably greater  than  for  an  electron  in  a  hydrogen-like  atom.  The  main  correction  to 
the  energy  levels  of  a  mesic  atom  will  therefore  come  from  the  volume  effect.  The 
equation  for  the  energy  of  s  states  can  be  obtained  from  (21.9)  and  (21.10)  by  replacing 
the  Bohr  radius  GO  by  the  corresponding  radius  in  a  mesic  atom 

a».  =  ~>>  (2UD 


obtaining,  therefore, 


2These  subjects  are  treated  in  more  detail  in  a  paper  by  D  D  Ivanenko  and  G  E  Pus- 
tovalova,  llspekhi  fiztcheskikh  nauk,  61,  27  (1957) 

3For  77  mesons,  which  strongly  interact  with  nuclei  and  are  responsible  for  nuclear 
forces,  the  lifetime  with  respect  to  decay  into  a  £t  meson  and  neutrino  is  equal  to  2  6  X 
10~8  sec,  whereas  the  lifetime  with  respect  to  capture  by  a  nucleus  is  many  times  smaller 
In  particular,  for  Tr-mesic  hydrogen  the  capture  time  of  a  negative  pion  from  the  orbit  is  of 
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Using  the  fact  that  the  nuclear  radius  is  related  to  the  mass  number  A^  2Z  by  the 
expression 


(21.  13) 

where  RQ  =  const,  we  obtain  in  the  first  approximation 


,  - 

,  i-o-  —  , 


*•  yi 

v/  J  ' 


It  follows  thatt  for  the  s  levels,  the  energy  correction  due  to  the  nuclear  volume  will 
be  proportional  to  Z*'*  •  and  therefore  attains  very  large  values  for  heavy  elements. 

In  heavy  elements,  the  orbit  of  a  ji  meson  may  even  be  inside  the  nucleus,  at  large 
Z  and  small  n.  rlhe  Bohr  radius  of  the  mesic  atom  becomes  equal  to  the  nuclear  radius 
forZ  —  Zcr  -45 

For  orbits  inside  the  nucleus  (Z  >  Zcr)  9  the  main  part  of  the  potential  will  no  longer 
be  determined  by  the  Coulomb  law  (21.4)  but  instead  by  formula  (21.3)  which  corresponds 
to  the  potential  of  a  three-dimensional  harmonic  oscillator  (on  the  assumption  of  a 
simplified  model  of  the  nucleus  in  which  the  charge  is  uniformly  distributed  over  the 
volume).  Thus  the  energy  has  to  be  determined  from  the  following  equation  instead  of 
(13.4): 

if'/?    ,  2   dR       ?/*,. 

dr*  ^  r    dr  ^  ft-   x 

/          3^jj        1  Ze^       1(1  \-U1P\n      n  (21.15) 

Xr+2/?7      '2  7^7™      tm-r*-)^0- 

In  Eq.  (21.15)  let  us  change  the  variable  by  setting  R=Y  r^'.  r=  V  P"-     Since 


it  follows  that  the  equation  for  the  energy  of  a  three-dimensional  harmonic  oscillator  is 
formally  identical  with  the  equation  for  the  hydrogen-like  atom 


but  has  different  values  of  the  constants,  namelyt 


. 
4ft2 

=  /'C'  +  l),  (21.17) 


To  determine  the  eigenvalues  of  the  energy  we  may  use  Hq.  (13.20),  according  to  which 


Substituting  for  Bt  A  and  /'their  values  from  (21*17)  and  using  the  fact  that  k  —  n  -  /—I, 
we  can  find  the  energy  of  the  meson  in  an  orbit  inside  the  nucleus: 

H  /+  8  s),  (21.18) 
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where  the  frequency  of  mechanical  vibrations  of  the  three-dimensional  oscillator  is 


The  quantity  8/2ft(o  represents  the  zero-point  energy  of  the  three-dimensional  oscillator; 
it  is  three  times  greater  than  the  corresponding  zero-point  energy  of  a  one-dimensional 

oscillator.  The  quantity  V0=  v  >>  -  is  the  greatest  depth  of  the  oscillator  potential  well. 

2   «N 

Equation  (21,18)  is  correct  on  the  assumption  that  the  potential  energy  varies  in  ac- 
cordance with  (21.3)  from  zero  to  infinity.  If  the  finite  size  of  the  nucleus  is  taken  into 
account,  an  additional  energy  is  obtained  that  represents  the  difference  between  the 
potential  energy  of  the  particle  in  the  oscillator  well  and  its  potential  energy  in  the 
Coulomb  field  (this  difference  being  averaged  over  the  space  outside  the  nucleus  X  The 
equation  for  the  additional  energy  is 


AE  vol  =J    w  .? 

(21.19) 

Since  the  wave  function  of  a  spherical  oscillator  is  similar  to  the  wave  function  for  a 
Coulomb  field  (It  is  determined  from  the  same  wave  equation  with  r  replaced  by  P  =  r9), 
it  decreases  exponentially  as  r  increases.  The 
correction  (21.19)  is  therefore  significant  only         _ _ ______________  £$, 

when  the  radius  rn  of  the  me  sic  atoms  is  close 
to  RN. 

As  an  example,  let  us  consider  the  ?p  ~ *  Is 
transition  in  lead  (Z  =  82).  If  the  mesic  atom 
of  lead  is  assumed  to  have  a  point  nucleus,  the 
energy  released  in  this  transition  can  be  found  .  ..  fQ 

from  (21.10)  '*'/* 

1m  Zsf*  ^'^    ^'^     Ene»"gy    levels    in    a    mesic 

E3  —  /;4  r=  '—  £-1--  =  M  Mev.      (21.20)  atom 

The  relativistic  and  spin  effects  in  the  ?/J1/t  — *  lst/i  transition  increase  this  energy  by 
the  amount  2  J 

A  (A/;)  ^2  Mev.  (21.21) 

Incidentally,  such  a  significant  role  of  the  relativistic  and  spin  effects  is  due  to  the  fact 
that  the  energy  is  expanded  in  terms  of  (Zot)s.  This  quantity  is  comparatively  large  for 
lead  (Z=  82,  a=  1/137).  Comparison  with  experiment  shows,  however,  that  in  this 
transition  an  energy  of  6  Mev  is  liberated  instead  of  the  predicted  16  Mev.  This  dis- 
crepancy between  theory  and  experiment  arises  because  the  1  s  state  for  lead  lies  inside 
the  nucleus.  In  the  {>p  state  the  orbit  lies  outside  the  nucleus  and  the  volume  effect  of 
the  nucleus  is  small.  If  we  take  the  energy  of  the  Is  level  from  (21.18),  and  the  energy 
of  the  2p  level  from  (21.10)  for  a  point  nucleus,  the  energy  of  the  2p  -  -  Is  transition 
will  be  3.6  Mev.  If  we  add  to  this  the  correction  (21.19)  for  the  energy  of  the  Is  level, 
the  energy  of  the  transition  will  come  to  about  5  Mev,  which  is  relatively  close  to  the 
experimental  value. 

A  study  of  the  multiplet  structure  of  the  '}p  level  in  mesic  atoms  enables  us  to 
determine  the  spin  of  a  ji  meson.  For  a  particle  with  integral  spin,  the  level  splits 
into  an  odd  number  of  components  (for  spin  0  no  splitting  occurs,  for  spin  1  three  lines 
are  observed,  etc.).  Since  the  2p  level  splits  into  two  components  (2/?i/a  and  2p*/a  ),  it 
was  established  that  the  spin  of  a  p,  meson  is  1/2.  On  the  assumption  of  a  point  nucleus, 
the  theoretical  splitting  calculated  from  Eq.  (20.17)  should  amount  to  about  0.55  Mev, 
with  the  'Ipif  and  2si/3  levels  coinciding.  When  the  finite  size  of  the  nucleus  is  taken 
into  account  in  a  mesic  lead  atom,  the  splitting  of  the  2/?«/3  —  2pi/^  levels  is  reduced 
to  0.2  Mev  and  the  2s,x  level  is  raised  above  the  2/?8/  level.  This  is  illustrated  in 
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Fig.    21.2.    From   the  above  data  It  follows  that  heavy  me  sic  atoms  will  emit  gamma 
quanta  having  energies  of  several  Mev.  Lighter  mesic  atoms  emit  x-rays. 

Because  of  the  significant  influence  of  the  size  of  the  nucleus  on  the  spectra  of  heavy 
mesic  atoms,  the  charge  distribution  inside  the  nucleus  can  be  determined  from  an 
analysis  of  the  spectra.  It  has  been  found  that  the  value  that  should  be  substituted  for 
/Vo  in  the  formula  for  the  electromagnetic  radius  of  a  nucleus  with  mass  number  A: 

KN*=R0A1'*  (21.22) 

is    1,  2  x  10  -13  cm   rather  than    1.4  x  10-13  cm   (the  value  assumed  for  nuclear  inter- 
actions), 

Similarly,  the  multiplet  structure  of  the  spectral  lines  of  mesic  atoms  can  be  used  to 
determine  the  magnetic  moment  of  the  ji  meson;  its  value  is  close  to  the  muon  magneton 


where  the  Lande  g  factor  is  g  —  1. 

The  theory  of  JLI  -mesic  atoms  is  based  mainly  on  electromagnetic  interactions.  By 
contrast,  in  the  theory  of  TT  -mesic  atoms,  a  great  part  is  played  by  the  nuclear  inter- 
actions, the  theory  of  which  is  far  from  complete.  Further  experimental  study  of 
TT  -mesic  atoms  and  an  explanation  of  the  semiempirical  laws  that  describe  their  be- 
havior will  have  important  bearings  on  future  work  in  the  theory  of  nuclear  forces. 
These  topics,  however,  lie  beyond  the  scope  of  this  textbook. 

D.  APPLICATION  OF  THE  DIRAC  EQUATION 
TO  THE  NEUTRON  AND  THE  PROTON 

The  Dirac  equation  describes  the  motion  of  particles  with  spin  1/2.  It  applies  to 
electrons  as  well  as  to  protons  and  neutrons.  In  the  presence  of  an  electromagnetic  field 
it  is  necessary  to  take  into  account  the  charge  of  the  proton,  as  well  as  the  so-called 
anomalous  magnetic  moments  of  the  proton  and  the  neutron.  We  recall  that  the  energy 
of  interaction  between  a  charged  Dirac  particle  and  an  electromagnetic  field  is 

Ve  =  e<l>—e  **A  .  (21.24) 

In  the  nonrelativistic  approximation  this  expression  contains  a  magnetic  moment  due  to 
the  intrinsic  (spin)  angular  momentum  (h  '2o) 


This   quantity   is    known  as  the  kinematic  or  Dirac  magnetic  moment.  In  passing  to  the 
relativlstic  equation,  we  must  replace  the  mass  ///,,in  Eq.  (21.25)  by  Its  relativistic  value 

/W0 

-  3,  and,  therefore,  the  Uirac  magnetic  moment  vanishes  as  the  velocity  approaches 

the  velocity  of  light   (v  ^  c). 

In  addition  to  the  Dirac  magnetic  moment,  which  appears  only  in  the  nonrelativistic 
approximation  and  which  depends  on  the  charge,  a  particle  may  hfve  an  anomalous 
magnetic  moment  that  does  not  vanish  even  in  the  relativlstic  case  and  is  independent 
of  the  particle's  charge. 

We  shall  now  find  the  energy  of  interaction  due  to  the  anomalous  magnetic  moment. 
The  energy  of  interaction  (21.24)  of  an  electron  with  an  electromagnetic  field  is  a 
scalar  quantity,  since  in  four-dimensional  space  i»l»  =  ^l1,  A^—Ai,  Vy—  /lj,  A?  —  <48. 
In  the  same  wav  the  unit  matrix  I  is  the  fourth  component  of  the  velocity  matrix  a  ^ 
(that  is,  a4  =  /l).4  The  interaction  energy  (21,24)  may,  therefore,  be  represented  as  a 
scalar  quantity  in  four-dimensional  notation 


4More  precisely,  the  quantity  }^  =  en/i^a^i//  [see  (17  32)1,  where  a-  ^1,2, 3,^  =  '!»  Wlil 
transform  as  a  four-vector 
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The  electromagnetic  field  forms  an  antisymmetric  tensor  of  second  rank 

dA        ^V 

"»•-§-«£•  <21-27> 

where 


It  follows  that 

//,=//„,     //v=//,i,    //,=  / 


* 


The  interaction  energy  of  the  anomalous  magnetic  moment  with  the  electromagnetic  field 
is,  therefore,  given  by 

i 
KOT=n    >]    V//[JlV,  (21.29) 

JX,    V-^l 

where  a[lv  is  a  second  rank  tensor  composed  of  the  Dirac  matrices. s 

Using  the  rules  for  transformation  of  a  wave  function  under  the  Lorentz  [see  (17.38)] 
and  spatial  rotations  [see  (17.39)],  we  can  show  that  the  quantities 


are  the  matrix  elements  forming  a  second  rank  tensor.  The  energy  of  interaction  between 
the  anomalous  magnetic  moment  and  the  electromagnetic  field  takes  the  form 

l'm==Mp3'-//-kpa*-£l  (21.31) 

An  electron  has  a  charge,  a  spin,  and  also  a  Dirac  magnetic  moment.  Its  anomalous 
magnetic  moment  is  relatively  small  (see  below).  A  neutron  has  no  charge,  but  it  does 
have  an  anomalous  magnetic  moment;  this  magnetic  moment  determines  the  interaction 
between  the  neutron  and  the  electromagnetic  field.  As  for  the  proton,  it  has  both  a 
charge  and  a  spin,  and  hence  a  Dirac  magnetic  moment;  in  addition,  it  has  an  anomalous 
magnetic  moment.  It  should  be  noted  that  nuclear  interactions  are  of  great  importance 
in  the  theory  of  nucleons. 

E.  EXPERIMENTAL  DETERMINATION  OF  THE  MAGNETIC 
MOMENT  OF  THE  NEUTRON  AND  THE  PROTON 

The  procedure  for  determining  the  magnetic  moment  of  the  neutron,  proton,  and 
complex  nuclei  is  basically  the  same  as  for  the  magnetic  moment  of  the  electron  (the 
Stern-Gerlach  experimentX  The  basic  principle  consists  in  applying  a  magnetic  field 
perpendicular  to  the  direction  of  motion  of  the  particle.  The  particle  will  react  dif- 
ferently depending  on  whether  its  magnetic  moment  is  oriented  parallel  or  antiparallel 
to  the  field. 

Let  us  first  consider  the  possibility  of  determining  the  Dirac  magnetic  moment 
and  the  anomalous  magnetic  moment  of  a  free  particle.  Suppose  a  free  particle  moves 
perpendicularly  to  the  z  axis.  The  Hamiltonian  describing  its  motion  has  die  form 

H  =  c^a^  +  cpja^  +  p3/;/0c2,  (21.32) 


The  component  of  the  intrinsic  angular  momentum  perpendicular  to  the  direction  of  this 
motion 

S,=  4-fl'«  (21.33) 


More  precisely,  the  quantity  ^^ny^v  *s  a  second  rank  tensor 
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does  not  commute  with  this  Hamiltonian.  The  component  of  the  total  angular  momentum 
along  the  z  axis  does  commute  with  the  Hamilton! an 


n<3*> 


(21.34) 


and  can  therefore  be  determined  exactly  together  with  the  energy. 

Let  us  evaluate  the  error  in  the  determination  of  the  orbital  angular  momentum  by 
means  of  the  uncertainty  relation.  If  the  origin  of  the  coordinate  system  Is  taken  to  be 
at  the  center  of  the  wave  packet,  the  error  in  the  orbital  angular  momentum  will  be 


,  A   A/?v  ~  ~  ,  Since  the 

A**  *y     A.V* 


In  accordance  with  the  uncertainty  relation,  we  have 

errors  may  be  either  positive  or  negative,  we  find 

|   .   | Ax | 

I     I     I  A  ti  I 


(21.35) 

The  error  bLz  will  be  minimum  when  |  AA*  |  =  |  Ay  |.  Thus  the  error  bLz  due  to  the 
translational  motion  of  the  particle  is  of  the  order  of  the  spin,  and  therefore  the  per- 
pendicular components  of  the  intrinsic  angular  momentum  and  the  Dirac  magnetic 
moment  cannot  be  determined  simultaneously. 
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Fig.  21  3      Experiments     for    determining    the 
moment  of  the  neutron 


magnetic 


We  recall  that  the  Stern-Gerlach  experiment  allowed  for  the  determination  of  a  mag- 
netic moment  of  a  bound  electron;  since,  however,  the  orbital  angular  momentum  in  the 
s  state  is  zero,  it  was  the  spin  (or  Dirac)  magnetic  moment  that  was  actually  measured. 

In  accordance  with  (21.31),  the  interaction  energy  associated  with  the  perpendicular 
component  of  the  anomalous  magnetic  moment  is 

I/  .  „  „       SW  /O  1    7A\ 

v m  '==='  I^PJ^S  <2sl  •  {£1*30) 

This  component  commutes  with  the  Hamiltonian  (21.32)  and  can  therefore  be  measured 
exactly.  Consequently,  it  is  possible  to  measure  the  magnetic  moment  of  a  free  neutron 
when  the  magnetic  field  is  perpendicular  to  its  motion,  as  was  done  by  Bloch  and  Alvarez 
(1940).b  In  their  experiments  when  a  beam  of  neutrons  was  passed  through  a  piece  of 
magnetized  iron,7  the  most  pronounced  scattering  was  observed  for  those  neutrons  whose 
magnetic  moment  was  parallel  to  the  magnetic  induction  vector  inside  the  iron.  The 
emerging  beam  consisted,  therefore,  mostly  of  neutrons  whose  magnetic  moment  was 
antlparallel  to  the  magnetic  Induction  vector.  If  the  neutron  beam  passes  now  through 
two  magnetized  iron  plates  in  succession,  the  experiment  is  completely  analogous  to 
the  transmission  of  light  through  two  Nicol  prisms;  that  is,  the  first  iron  plate  acts  as 


Similarly,  it  may  be  shown  that  only  the  longitudinal  component  of  spin  commutes 
with  the  Hamiltonian  In  principle,  therefore,  it  also  can  be  measured  experimentally 

Neutrons  have  no  electric  charge  and  they  pass  quite  freely  through  matter  Actually, 
their  only  interactions  occur  on  collisions  with  the  nuclei. 
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a  polarizer  and  the  second  as  an  analyzer.  This  phenomenon  was  used  to  determine  the 
magnetic  moment  of  the  neutron. 

A  schematic  diagram  of  the  Bloch-  Alvarez  experiments  is  given  in  Fig.  21.3. 

Unpolarized  neutrons  moving  along  the  direction  of  the  x  axis  pass  through  the 
polarizer  (first  iron  plate,  with  a  magnetic  induction  vector  directed  upwards).  The 
emerging  beam  consists  mostly  of  neutrons  whose  magnetic  moment  is  directed  down- 
wards. These  polarized  neutrons  will  pass  freely  through  the  analyzer  if  its  magnetic 
induction  vector  is  directed  upwards,  like  that  of  the  polarizer.  On  the  contraryf  they  will 
be  transmitted  much  more  weakly  if  the  iron  plates  are  oppositely  magnetized. 

Between  the  polarizer  and  analyzer  there  is  a  device  that  reorients  the  neutron 
spin.  This  device  is  similar  in  principle  to  the  one  used  by  Rabl  in  his  nuclear  magnetic 
resonance  experiments  to  determine  the  magnetic  moment  of  the  proton  and  of  heavier 
nuclei.  The  basic  principle  of  the  instrument  is  as  follows.  In  the  space  between  the 
polarizer  and  the  analyzer,  a  relatively  strong,  constant  magnetic  field  is  applied 
parallel  to  their  magnetization  vectors  (see  Fig.  21.3).  A  neutron  whose  magnetic 
moment  is  antiparallel  to  this  magnetic  field  acquires  the  additional  energy 


!*„<#".  (21.37) 

If,  however,  the  magnetic  moment  is  parallel  to  the  field,  the  neutron  loses  an  energy 

A£  Par   rrr-jj^ST.  (21.38) 

In  addition,  there  is  a  relatively  weak  oscillatory  magnetic  field  //  applied  perpendicu- 
larly to  the  field  £% 

Hy  =  A  cos  «0f. 

This  oscillatory  field  will  reorient  the  spin  of  the  neutron  particularly  strongly  when  the 
frequency  w0  is  close  to  the  resonance  frequency 


(2U9) 


At  this  resonance  frequency,  the  number  of  neutrons  undergoing  a  reorientation  of  the 
magnetic  moment  reaches  its  maximum  value.  To  find  when  this  happens,  it  is  necessary 
to  determine  when  the  number  of  neutrons  passing  through  the  analyzer  is  minimum  in 
the  case  of  parallel  magnetic  induction  vectors,  or  maximum  in  the  case  of  antiparallel 
magnetic  vectors. 

Once  the  frequency  <o0  has  been  determined  (it  is,  in  effect,  twice  as  great  as  the 
Larmor  frequency  of  precession),  the  magnetic  moment  of  the  neutron  can  be  found, 
According  to  recent  data  it  is  equal  to  8 


where  the  unit  for  measuring  magnetic  moments  is  the  nuclear  magneton 


where  mp  is  the  mass  of  a  proton  and  JAO  is  the  Bohr  magneton. 

From  the  resonance  frequency  of  the  oscillatory  field,  we  can  determine  the  magnitude 
of  the  magnetic  moment  but  not  its  sign.  If,  however,  we  replace  the  oscillatory  magnetic 
field  by  a  rotating  magnetic  field,  we  can  also  determine  the  sign  of  the  moment,  since 
for  resonance  it  is  necessary  that  the  vector  equation  o>0  =  —  2o  should  hold,  where 
<jw» 

is  the  Larmor  frequency  of  precession  of  the  neutron  spin.    The  minus  sign 

shows  that  the  magnetic  moment  of  the  neutron,  just  as  for  the  electron,  is  directed 
opposite  to  the  spin. 


8Subacquent   improvements   on  the    Bloch   and   Alvarez    experiments   are    described   in 
E.  Segre,  Experimental  Nuclear  Physics,  Vol.  I,  1953. 
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The  resonance  frequencies  at  which  the  magnetic  moment  of  a  neutron,  proton,  or 
other  nucleus  is  reoriented  in  a  field  Q%" '~  10*  oersted  are  in  the  microwave  region. 
From  (21.39)  these  resonance  frequencies  will  be  of  the  order  of  102  Me  (this  corre- 
sponds to  a  wavelength  of  several  meters). 

The  magnetic  moment  of  the  proton  can  be  found  only  in  a  bound  state  since  the  proton 
is  a  charged  particle  and  therefore  has  a  Dlrac  magnetic  moment  in  addition  to  the 
anomalous  magnetic  moment.  The  experiments  cannot  be  carried  out  with  individual 
atoms,  but  only  with  molecules  in  which  the  electron  spins  cancel  out.  For  instance, 
the  magnetic  moment  of  the  proton  can  be  found  by  applying  the  method  of  nuclear  mag- 
netic resonance  to  the  H^  molecule.  To  obtain  molecules  with  polarized  protons,  we  do 
not  pass  them  through  magnetized  iron  (H2  molecules  do  not  pass  through  iron),  but 
instead  split  the  beam  by  passing  it  through  an  inhomogeneous  magnetic  field  in  the 
same  way  as  in  the  experiments  of  Stern  and  Gerlach.  This  method  of  measuring  the 
magnetic  moment  of  a  proton  was  first  used  by  Rabi  (1934),  even  before  the  experiments 
of  Bloch  and  Alvarez. 

The  magnetic  moment  of  the  proton  has  a  positive  sign;  that  is,  it  is  parallel  to  the 
spin.  It  is  equal  to  \*.p  =  -f-  2.7928  (in  units  of  a  nuclear  magneton).  It  contains  two 
components:  one  (fxp  =  I)9  corresponds  to  the  Dirac  magnetic  moment  and  the  other 
(j,anom  =  1.7928)  to  the  anomalous  magnetic  moment.  The  anomalous  magnetic  mo- 
ments of  the  proton  and  neutron  are  approximately  equal  and  have  opposite  signs.  The 
magnetic  moments  of  the  deuteron  (f*]-)=  +  0.8573)  and  other  nuclei  have  been  deter- 
mined by  similar  methods. 

One  of  the  main  results  of  the  experimental  determination  of  the  magnetic  moments 
of  nuclei  was  the  fact  that  they  are  of  the  order  of  the  nuclear  magneton,  whereas  the 
magnetic  moments  of  atoms  are  of  the  order  of  the  Bohr  magneton,  which  is  (mp //wa) 
times  larger  than  the  nuclear  magneton.  This  difference  served  initially  as  a  strong 
argument  in  favor  of  the  proton-neutron  model  of  the  atomic  nucleus,  according  to  which 
electrons  (which  have  a  large  magnetic  moment)  are  not  present  in  nuclei. 

F.  HYPERFINE  STRUCTURE  OF  THE 
HYDROGEN  SPECTRUM 

The  hyperfine  structure  of  spectral  lines  is  due  to  the  interaction  between  the  mag- 
netic moment  of  the  nucleus  and  the  magnetic  moment  of  the  electron.  If  the  nucleus  of 
the  hydrogen  atom  (Z  =  1)  has  a  magnetic  moment  np  =  ppVp  where  ap  are  the 
Paul!  matrices  acting  on  the  wave  function  of  the  nucleus,  it  gives  rise  to  a  magnetic 
field 

4=Vx  5VX  tf=Vx  A. 

This  magnetic  field  acts  on  the  magnetic  moment  of  the  electron  ji  =  —  fA0or  (a1  acts  on 
the  wave  function  of  the  electron),  which  results  in  an  additional  interaction  between 
the  nucleus  and  the  electrons  and  leads  to  the  hyperfine  structure  of  the  energy  levels 


(21.40) 


In  the  first  approximation  we  may  assume  that  there  are  no  preferred  directions  and, 
hence,  making  use  of  the  equations 

(«W)(vV)==^-(a'-ap)V3  (21.41) 

and 


This  should  properly  read:  At?ir  —   1 
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we  obtain 

<21-42> 

Consequently,  In  the  first  approximation,  the  interaction  of  magnetic  moments—  just  like 
the  contact  interaction—  influences  only  the  s  state.  The  expression  (o'-a^)  in  (21.42) 
can  be  found  from  the  following  simple  considerations. 

The  spin  matrices  of  the  proton  v'p   and  the  electron  a1  must  satisfy  the  relation 

I  W  (a'  +  a'p)*  =  ft'S  (S  +  1),  (21.43) 

where  S  is  the  absolute  value  of  the  total  spin,  which  is  equal  to  either  zero  (antiparal- 
lell  spins)  or  to  unity  (parallel  spins).  Then 


Using  the  fact  that    a12  +  o^2  =  6,  we  obtain 

(a'.ap  =  2S(S+l)-3.  (21.44) 

Since  integration  when  the  B  function  is  present  gives 

)==  |  <MO)  I  », 


we    obtain    the    following   expression  for  the   shift  of  s  levels  (hyperfine  structure)  of 
hydrogen: 

A£4,  =  |-  mp,  -L,  [2S  (S  +  1)  -  3]  ,  (21.45) 


fi* 

where  a0  —  •  -  ^  is  the  radius  of  the  first  Bohr  orbit,  and  the  value  of  |  <p  (0)  |  8  is  taken 

MO^O 

from  Eq.  (20.  14  X 

Two  cases  should  be  distinguished: 

(1)  Spins  of  the  proton  and  the  electron  are  antiparallel  (5  =  0);  then 

^-^-Wp-^sr  •  (21*46> 

(2)  Spins  of  the  proton  and  the  electron  are  parallel  (S  =  1);  then 

,-.  (2U7) 


The   difference   between   these  levels   represents   splitting   of  the  s  term  due  to  the 
interaction  between  the  electron  and  the  magnetic  moment  of  the  nucleus 


(21.48) 


If  we  use  (21,48)  to  calculate  the  s-term  splitting  for  the  case  n  =  1,  then  substituting 
the  value  of  ILP  obtained  In  Rabi's  experiments  and  setting  fi0  equal  to  the  Bohr  mag- 
neton, we  find 

A(otheor=1417   Me. 

On  the  other  hand,  a  careful  experimental  verification  of  the  splitting  of  this  level  by 
microwave  spectroscopic  methods  has  given  the  value 

Awe*P  ==  1420  Me. 
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The  relativistlc  corrections  and  the  corrections  for  the  finiteness  of  the  nuclear  mass 


do  not  raise  the  frequency  Ato*160*"  to  the  required  value  Ao>exP.  The  proton's 
magnetic  moment  has  also  been  measured  very  accurately.  To  explain  this  anomaly, 
therefore,  it  remained  to  assume  that  the  magnetic  moment  of  the  electron  is  not  exactly 
equal  to  the  Bohr  magneton,  but  is  instead  somewhat  larger,  Kusch  and  Foley  showed  that 
to  obtain  agreement  with  experiment  the  magnetic  moment  of  the  electron  must  be  taken 
to  be 


(2U9) 
where,  according  to  recent  data, 

6  =  0,00116. 

These  considerations  show  that  an  electron  will  have  a  very  small  anomalous  mag- 
netic moment  /JL  anom  =  fi0&  in  addition  to  the  Dirac  or  kinematic  magnetic  moment 

(  -   fxu).    We   shall  discuss   the   nature   of  the  anomalous  magnetic  moments  in  the  next 
chapter, 

Concluding  this  section,  it  is  necessary  to  point  out  that  the  hyperfine  structure  can- 
nor  explain  the  Lamb  shift  of  the  2si/a  level  (1.058  Me  relative  to  the  2/7  1/^  level).  First 
of  all,  it  follows  from  (21.48)  that  the  splitting  of  the  2si/0  term  is  of  the  order  of  200 
Me,  and,  in  addition,  the  center  of  mass  of  the  s  terms  is  not  shifted.  Suppose,  the 
level  with  S  =  1  (parallel  spins  of  the  electron  and  the  proton)  is  raised  by  a  certain 
amount  [see  (21.47)];  then  the  level  with  S  —  0  (antiparallel  spins  of  the  electron  and 
the  proton)  is  lowered  by  an  amount  three  times  as  large  [see  (21.46)].  Since  a  state 
with  S—  1  is  three  times  more  probable  than  a  state  with  S  —  0  (when  S  ==  1  the 
spin  may  be  directed  along  the  z  axis,  opposite  to  the  z  axis  and  perpendicular  to  the 
z  axis),  the  center  of  mass  of  the  s  states  remains  unaltered,  so  that  it  occurs  in  the 
same  position  as  when  the  magnetic  moment  of  the  nucleus  is  neglected.  The  hyper- 
fine  structure  cannot  therefore  account  for  the  Lamb  shift.  The  theory  of  this  effect 
will  be  considered  in  the  next  chapter. 


roblem  21.Q)  Find  the  shift  of  the  Is  levels  in  the  light  p  -me  sic  atoms  as  a  result 
of  iliS  .....  in&QElHJir  of  the  nuclear  structure,  taking  into  account  the  nuclear  motion  and 
also  the  variation  of  the  wave  function  within  a  distance  comparable  to  the  size  of  the 
nucleus. 

Hint.  The  shift  of  energy  levels  should  be  calculated  by  the  perturbation  method.  We 
use  Eqs.  (21.5)  and  (21.7)  where  the  wave  functions  are  those  of  hydrogen-like  atoms  with 
the  meson  mass  substituted  for  the  electron  mass.  Because  of  the  finiteness  of  the 
nuclear  mass,  the  motion  of  the  nucleus  has  much  greater  influence  on  the  position  of 
the  energy  levels  of  a  meslc  atom  than  on  the  levels  of  an  ordinary  atom.  It  is  there- 
fore necessary  to  use  the  reduced  meson  mass  in  more  accurate  calculations.  The 
wave  functions  of  a  meson  change  appreciably  inside  the  nucleus;  therefore,  the  quantity 
|  •]/  |a  can  be  replaced  by  its  value  at  the  origin  only  in  a  comparatively  crude  qualita- 
tive estimate,  as  in  the  derivation  of  (21,14). 

Answer.   In  the  general  case 

•j  '>  r    o 


{•j  '>  r    o  o  *i  ~i  \ 

1  ~Tb  +  '2¥~[tP  +  &  +  y>\  e     \  ' 


where 


M 
For  small  b  and  Al  -*>  co,  this  expression  may  be  obtained  from  Eq. 


Chapter  22 

The  Electron-Positron  Vacuum  and  the 
Electromagnetic  Vacuum 


iIRAC  THEORY  OF  "HOLES."  DISCOVERY  OF 
THE  POSITRON 

The  Dirac  theory,  which  includes  spin  effects  and  relativistic  effects,  was  able  to 
account  for  the  fine  structure  of  the  spectral  lines  of  hydrogen-like  atoms  and  the 
anomalous  and  normal  Zeeman  effects.  The  Dirac  theory,  however,  also  gave  rise  to 
a  number  of  major  difficulties  in  connection  with  the  interpretation  of  negative  energy 
states.  These  difficulties  were  not  overcome  for  some  time,  but  they  eventually  led  to 
fundamental  new  discoveries  In  relativistic  quantum  mechanics. 

In  our  treatment  of  the  motion  of  a  free  particle  (Chapter  17),  we  mentioned  that  the 
Dirac  equation  allowed  solutions  corresponding  to  both  positive  and  negative  values 
of  energy.  It  is  worth  noting  in  this  connection  that  solutions  with  negative  energy  are 
not  characteristic  of  the  Dirac  theory  alone,  but  appear  in  any  relativistic  theory.  In 
relativistic  mechanics,  the  energy  of  a  free  particle  is  connected  with  the  momentum 
and  rest  mass  by  the  well-known  expression 


which  has  two  roots 


The    regions  of  positive  and  negative  energies  are  separated  by  an  interval  equal  to 

2m0c2     (see  Fig.   22,1).    At  first  glance,  states  with  negative  energy  do  not  appear 

to  have  a  real  physical  meaning,  since  the  region  of  negative  energies  extends  to  infinity 

(£  =  —  co)   and,  therefore,  there  is  no  lowest 

state.   This  would  imply  that  no  ordinary  state 

is  stable,  since  a  spontaneous  transition  to  a 

lower  energy  state  would  always  be  possible,       — 

Furthermore,   a  particle  with  negative  mass       — 

(negative   energy)   would   have    a   number    of       • — 

strange  properties:  for  example,  it  would  repel 

a  particle  with  positive  mass.  — 

In  classical  physics,  states  with  negative 

energy  do  not  cause  any  difficulties,  because        

the  energy  of  a  moving  particle  can  only  change       ~ 

continuously;    therefore,  transitions  from  the       — 

states  with  positive  energy  to  the  states  with 

negative  energy  cannot  take  place,  since  the 

energy  would  have  to  change  dlscontinuously  by       Fig.  22.1.  Allowed  energy  levels  of  a 

the    amount  A£^2//ioC2,    Defining  the  energy  free  Dirac  particle 

to  be  positive  from  the  start,  we  may,  therefore, 

neglect  the  negative  energy  states. 

The  situation  is  quite  different  in  quantum  theory,  where  transitions  can  take  place 
between  the  states  in  a  discrete  spectrum,  as  well  as  in  a  continuous  spectrum.  States 
with  negative  energy  cannot  be  excluded  simply  by  defining  the  energy  to  be  positive 
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at    the    initial  time,  because   the  probability  of  the  transition  between  the  states  with 
energy  -f  m0c*  and  —  W0e2  is  not  equal  to  zero. 

In  order  to  avoid  transitions  of  electrons  to  negative  states,  Dirac  suggested  (1931) 
that    we    regard    all   negative  energy  Jevels   as   occupied  by  electrons  (see  Fig.  22.2), 

so  that  an  electron  with  a  positive  energy  cannot 

Jump  into  a  negative  energy  level  under  ordinary 

. '    "    ".        "  _.      '  ._"  conditions.1     The    state    in  which   all   negative 

-  •  •     •  "•  •  _  _  •   •  energy  levels  and  no  positive  energy  levels  are 

mQc*          occupied  is  called  the  "electron  vacuum," 

Let  us  now  assume  that  a  7-ray  photon  with 

2m0c2  energy  *>  2/HoC2  excites  an  electron  from   the 

2         electron  vacuum  into  a  positive  energy  state.  In 
-      -  •-  »  . "_  ~  Q  this   case,  the  absorbed  'Y-ray  photon  will  be 

— • replaced  by  an  electron  with  positive  energy,  and 

;     ;  J  —     .;  *'        '     -  a  "hole"  will  appear  In  one  of  the  negative  energy 

a_ states  (see  Fig.  22.3). 

The  decisive  factor  that  led  to  the  success  of 

Fig   22  2     Zero-point   energy    dia-  Dirac's   hypothesis   was  that  he  interpreted  the 

Kram       of       the       electron-positron  "hole"   a*   *  particle  (the  "positron")  with  posi- 

varuum  tive  mass  eclua^  to  t*ie  mass  of  the  electron,  but 

um  with  the  opposite  charge.2    Let  us  suppose  that 

there  are  no  particles  at  the  initial  time.  Then 

the  "zero-point  energy"  Evac  (the  energy  of  the  electron  vacuum)  is  equal  to  the  sum  of 
the  energies  of  the  electrons  in  the  negative  energy  states  /*_ 


The  "zero-point  charge"  is  equal  to 


Thus,  from  the  standpoint  of  the  hole  theory,  the  absence  of  any  real  particles  means 
that  all  positive  energy  states  are  empty,  and  all  negative  energy  states  are  occupied. 
This  case  corresponds  to  the  electron  vacuum  (see  Fig.  22.2). 

When  an  electron  Jumps  from  a  negative  energy  state  //_  to  a  positive  energy  state  w+ , 
the  total  energy  change  of  the  system  is 

I  (22.2) 


This  change  represents  the  sum  of  the  positive  energies  of  two  nascent  particles.3  Similar 
arguments  with  regard  to  charge  show  that  the  charge  of  the  nascent  particle  correspond- 
ing to  the  "hole"  is  opposite  to  the  electron  charge 

.  (22.3) 


In   accordance  with  the  Pauli  exclusion  principle  (see  Chapter  6),   only  one  electron 

can  occupy  each  state 
2 
A  similar  conclusion  was  reached  in  the  treatment  of  the  hole  theory  of  semiconductors 

(see  the  discussion  of  the  band  theory  of  conduction  in  Chapter  6) 

The  prime  attached  to  the  summation  sign  (^  )  means  that  the  summation  extends  over 
all  states  r»'_  except  the  state  n'_   -  H_ 
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Thus,  the  transition  of  an  electron  from  a  negative  energy  state  to  a  positive  energy 
state  (as  a  result  of  the  absorption  of  a  7-ray  quantum  with  energy  greater  than  2/«0f M) 
leads  to  the  creation  of  a  pair  of  particles.  The  unoccupied  negative  energy  state  ("hole") 
may  be  regarded  as  a  state  occupied  by  a  particle  with  positive  charge  +  e0  and  positive 
energy.4  This  particle,  which  was  predicted  by 
Dirac,  was  called  a  "positron"  and  was  dis- 
covered by  Anderson  in  cosmic  radiation  (1932). 

Once  it  is  interpreted  in  this  way,  the  Dirac 
theory  describes  in  a  natural  manner  both  the 
electron  and  the  positron.  The  positron  is  an 
antiparticle;  its  wave  function  satisfies  the  Dirac 
equation  with  positive  energy  and  positive  charge 
(see  Problem  18.5). 

In  the  Dirac  theory,  pair  annihilation  —  a 
process  which  is  the  reverse  of  pair  creation— is 
also  allowed.  This  process  takes  place  when  an 
electron  with  positive  energy  jumps  into  a  hole. 
In  this  case  the  electron  and  positron  are  con- 
verted into  7*-rays. 

In  these  transformations,  the  laws  of  conser- 
vation of  energy  and  momentum  are  rigorously 

obeyed.  As  already  mentioned,  pair  creation  due  to  the  absorption  of  a  7-ray  photon  can 
occur  only  in  the  presence  of  a  third  particle  (for  example,  a  nucleus),  that  takes  up  the 
excess  momentum  of  the  photon 


Fig.  22.3.  Formation  of  an  electron- 
positron  pair 


Ze0 


(22.4) 


Similarly,  the  conversion  of  an  electron-positron  pair  into  T-rays  takes  place  in  accord- 
ance with  the  laws  of  conservation  of  energy  and  momentum;  as  a  result  of  pair  annihila- 
tion, at  least  two  'Y-ray  photons  are  created 

*,+*__  =  27.  (22.5) 

In  order  to  show  this,  we  may  choose,  without  the  loss  of  generality,  a  coordinate  system 
in  which  the  electron  and  positron  move  with  opposite  momenta,  so  that  k _  =  —  fc+~k 
(the  center-of-mass  system).  Then,  according  to  the  law  of  conservation  of  momentum, 
the  total  momentum  of  the  two  photons  which  are  formed  as  a  result  of  annihilation  must 
also  equal  zero 

#  (ki  +  k»)  =-  0.  (22. 5a) 


Using  the  law  of  conservation  of  energy 


cH  (ki  4-  k2)  ~  2  J/OTiJic4  -f  c-'  W, 
we  find  that  &t  ==  —  £j,  and  the  energy  of  each  of  the  photons  is  equal  to 


(22.5b) 


e  s=  cfiki  =  cfc/Ja  =  fmlc*  +  c*1l*k*. 

The  lowest  value  of  the  photon  energy  is  obtained  when  k  =  0  (that  is,  when  the  electron 
and  positron  are  at  rest).  Then  t  mln  =  ;«0ca.  These  two  7-ray  photons  move  apart  with 
the  same  energy  and  oppositely  directed  momenta.  It  is  easy  to  see  that  the  electron- 
positron  pair  cannot  be  converted  into  a  single  7-ray  photon  (&«  =  0),  since  the  laws  of 
conservation  of  energy  and  momentum  cannot  be  satisfied  simultaneously  with  only  one 
y  -ray  photon. 


With  the  help  of  quantum  field  theory,  we  can  construct  a  theory  of  the  electron- 
positron  vacuum  that  is  symmetric  with  respect  to  charge.  However,  even  with  the  theory 
described  above,  which  is  asymmetric  with  respect  to  electrons  and  positrons  (an  electron 
is  a  particle,  whereas  a  positron  is  a  "hole")  we  can  give  a  clear  explanation  of  many 
phenomena  involving  the  transformation  of  particles 
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The  law  of  conservation  of  total  angular  momentum  (orbital  plus  spin)  is  also  very 
Important  in  the  annihilation  processes.  If  an  electron  moves  with  a  nonrelativistic 
velocity,  this  law  (as  shown  in  Chapter  18)  can  be  resolved  into  a  law  of  conservation  of 
orbital  angular  momentum  and  a  law  of  conservation  of  spin. 

The  law  of  conservation  of  spin  can  be  observed  particularly  clearly  in  the  annihila- 
tion of  positromum  (a  hydrogen  atom  in  which  the  proton  is  replaced  by  a  positron,  or 
more  exactly,  a  system  in  which  an  electron  and  a  positron  rotate  about  their  common 
center  of  massX  In  this  atom,  the  nucleus  (that  is,  the  positron)  and  the  electron  have  the 

same  mass,  and  therefore  the  reduced  mass  is  -y-.  The  energy  of  levels  in  the  positron- 
ium  atom  will,  therefore,  be  one  half  as  large  as  In  the  hydrogen  atom 

RK 
Ln~~      2/i5"' 

while  the  radius  of  the  orbit  will  be  twice  as  large.  The  velocities  of  the  electron  and 
positron  may  be  regarded  as  nonrelativistic,  Just  as  in  the  hydrogen  atom.  If  the  spins 
of  the  electron  and  positron  are  antiparallel  (parapositronium),  positronium  can  decay 
into  two  %-ray  photons  (the  corresponding  mean  life  Is  I  25  -  10~10  sec). 

The  total  spin  of  parapositronium  is  equal  to  zero,  and  therefore  the  two  photons  move 
apart  with  opposite  directions  of  spin  (that  is,  their  total  spin  is  zeroX 

If,  however,  the  spins  of  the  electron  and  positron  are  parallel  (orthopositronium)  the 
system  must  decay  into  three  7-ray  photons  (the  corresponding  mean  life  is  1.4  •  10~7sec). 
The  spin  of  orthopositronium  is  equal  to  unity.  Orthopositronium  cannot  decay  into  one 
>-ray  photon  with  spin  1,  because  in  this  case  the  law  of  conservation  of  momentum 
would  be  violated.  It  cannot  decay  into  two  7 -ray  photons  because  then  the  law  of  con- 
servation of  spin  would  be  violated  (the  total  spin  of  two  7-ray  photons  Is  either  two  or 
zero).  Only  if  orthopositronium  decays  into  three  7-ray  photons  will  the  law  of  con- 
servation of  momentum  and  the  law  of  conservation  of  spin  be  satisfied. 

The  discovery  of  the  positron  opened  a  new  stage  in  the  study  of  elementary  particles. 
This  discovery  showed  that  particle  shad  a  new  fundamental  property— interconvertibility— 
and  confirmed  the  existence  of  antlparticles.  We  can  regard  the  creation  of  a  positron 
as  the  conversion  of  a  7-ray  photon  into  an  electron-positron  pair,  and  the  annihilation 
of  an  electron-positron  pair  as  the  conversion  of  an  electron-positron  pair  into  7-ray 
photons. 


B.  THE  LAMB  SHIFT  OF  ENERGY  LEVELS  OF 
ATOMIC  ELECTRONS 

When  an  electron  moves  in  an  atom,  it  interacts  with  the  electromagnetic  vacuum, 
as  well  as  with  the  atomic  nucleus  and  the  electron-positron  vacuum.  The  interaction 
between  the  electron  and  the  electromagnetic  vacuum  exerts  a  particularly  strong 
influence  on  the  motion  of  the  electron  in  the  atom,  and  it  explains  the  shift  of  the  2s1/a 
level  upwards  relative  to  the  2/;1/4>  level  (in  the  hydrogen  atom). 

A  complete  theory  of  this  phenomenon  can  be  constructed  only  by  means  of  quantum 
electrodynamics,  which  is  based  on  the  theory  of  second  quantization.  But  even  without 
referring  to  this  theory  we  can  still  obtain  the  appropriate  equations  with  accuracy  up  to 
coefficients  of  the  order  of  unity,  while  using  comparatively  simple  physical  arguments. 

One  of  the  basic  ideas  of  the  quantum  field  theory  is  that  each  wave  or  field  can  be 
associated  with  a  particle,  'thus,  for  example, the  Dlrac  ty  waves  correspond  to  electrons 
and  positrons,  and  light  waves  correspond  to  photons.  It  is  very  well  known,  however, 
that  Maxwell's  equations  describe  not  only  light  waves,  but  also  electrostatic  and  magnetic 
fields,  which  depend  on  charges  and  their  velocities  (an  accelerating  charge  produces 
electromagnetic  or  light  wavesX  The  electrostatic  and  magnetic  fields  can  be  associated 
with  "pseudo-photons,"  which  have  observable  effects  only  in  the  presence  of  charges. 
An  electrostatic  field  can  be  expanded  in  a  Fourier  series,  that  is,  schematically  repre- 
sented as  a  set  of  oscillators  with  different  frequencies.  An  analogous  expansion  holds 
for  the  vacuum  field  of  "pseudo-photons" 

£P.P. =  2  E  (0>)  C°S  w'' 
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Since  the  rest  mass  of  a  "pseudo-photon"  is  zero,  we  may  write  the  relation  between  its 
frequency  w   and  the  wave  number  k  as  follows: 


/**+*;  +*t  (22-7) 

The  components  kx,  kv,  kz  are  related  to  the  Integers  nh  n3,  nt  and  the  period  L  (see 
Chapter  6)  by  the  expressions 


.  !         , 

kx  =  -j—  ,  and  so  forth, 

where  «i  =  it  1,  ±2,  ±3  .....  Hence 


so  that 


or 

<o2dw        __  8rc3 

-?r-*2—  TT-. 
If  the  system  is  spherically  symmetric,  this  relation  may  be  written  as 


(22.8) 


With  the  aid  of  Eq.  (22.6),  we  find  that  the  energy  of  the  electrostatic  field  inside  a  region 
of  volume  L*  is 


=  i  J (£)2  *'*  =  8«  2 (£  (to))2-  (22-9) 


Lp.p. 


In  deriving  this  expression,  we  have  used  the  relation 

( cos  tot  cos  "'Oav  —  TT  &<w 
z 

Just  as  in  the  theory  of  the  harmonic  oscillator,  the  energy  of  the  field  in  the  lowest 
energy  state  is  not  equal  to  zero,  but  instead  is  equal  to  the  sum  of  the  zero-point 
energies  of  the  harmonic  oscillator  (see  Chapter  10) 


The  coefficient  2  in  front  of  the  summation  sign  takes  into  account  the  fact  that  each 
harmonic  corresponds  to  two  different  states  of  polarization  of  the  "pseudo-photons." 
In  the  state  where  there  are  no  real  photons  the  total  energy  of  an  electromagnetic 
field  (the  vacuum),  must  be  equal  to  this  zero-point  energy  (e  p^  =  «o).  Hence,  taking 
(22.8)  into  account,  we  find 

(E(»W=*H»=**.-°y±.  (22.K)a) 

In  the  absence  of  real  particles  and  external  fields,  the  vacuum  (including  the  electro- 
magnetic one  containing  photons  and  pseudo-photons)  does  not,  as  a  rule,  have  any 
observable  effects  because  it  is  isotropic.  On  the  other  hand,  when  real  particles 
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and  external  fields  are  present,  the  Isotropy  is  disrupted  and  virtual  particles  ("pseudo- 
photons"  or  electron-positron  pairs)  are  created  and  subsequently  annihilated  (vacuum 
fluctuations). 

A  simple  physical  picture  of  a  few  basic  notions  involved  in  the  theory  of  the  vacuum 
was  obtained  somewhat  unrlgorously  by  Welton  by  means  of  a  semiclassical,  nonrelativistic 
treatment  of  the  motion  of  an  electron,  taking  into  account  its  interaction  with  the  vacuum 
fluctuations.  As  an  illustration  of  Welton's  calculations,  let  us  attempt  to  give  a  more 
concrete  meaning  to  the  zero-point  fluctuations  (22. 10JL  In  a  rough  approximation  we 
can  use  the  ordinary  classical  equation 

mor^Ep.p.  (22.11) 

to  find  how  an  electron  will  be  affected  by  the  vacuum  field  of  "pseudo-photons."  Ex- 
panding £p  -  in  a  Fourier  series 

£p.p.  =  2£(<l>)coso>'  (22-12) 

U) 

and  integrating  Eq.  (22.11),  we  find  the  displacement  of  the  electron  positron  due  to  the 
vacuum  field: 

«^-  (22-13> 

The  mean-square  deviation  of  the  position  is  given  by 

(SR)i  =  (7F-  (r)'  =  -  — ->! 

since 


2 
and  therefore  r  =  0    Substituting  (22.10a),  we  obtain  a  divergent  integral 


==l  ,«/_»  _}Y'<"(  (22.15) 

n  tic  \  m0c  I    J    u>  »  \    *     / 


from  which  a  finite  (observable)  part  can  be  separated  out  if  the  range  of  variation  of  the 
frequency  is  cut  off  from  above  at  a  frequency  corresponding  to  the  rest  energy  of  the 
electron 


and  from  below  at  a  frequency  corresponding  to  the  minimum  energy  of  the  electron  in 
an  atom5 

]/:j       mn*4 
"*in=V  =  aB?'  <22-17> 

Substitution  of  (22.17)  and  (22.16)  into  (22.15)  gives 

n^.  (22.18) 

2 


The  limits  of  the  range  of  variation  of  <y  are  specified  more  accurately  in  renormali- 
zation  theory 
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Thus  it  can  be  seen  that  the  vacuum  field  of  "pseudo-photons"  will  cause  the  electron 
to  perform  a  motion  somewhat  resembling  Brownian  motion  with  a  definite  value  of  the 
mean  square  displacement.  It  is  well  known  that  the  Brownian  motion  of  a  particle  is  due 
to  its  collisions  with  randomly  moving  molecules  of  the  surrounding  medium.  In  similar 
fashion,  an  electron  undergoes  "collisions"  (of  a  special  kind)  with  the  assembly  of 
virtual  particles  forming  the  vacuum. 

As  was  shown  by  N.  Bogolyubov  and  S.  Tyablikov,  the  vacuum  fluctuations  cause  a 
certain  "smearing  out"  of  a  point  electron.  As  a  result,  the  electronic  radius  turns  out 
to  be  the  geometric  mean  of  the  classical  radius  and  the  Compton  wavelength 


TT^  y~JLm  (22.19) 

w0ca  m0c       '      m0c 

The  existence  of  this  effective  radius  should  have  several  consequences.  In  particular, 
the  interaction  between  the  electron  and  the  nuclear  charge  should  be  changed,  and  this, 
in  turn,  should  lead  to  an  additional  coupling  energy  and  thus  to  a  shift  of  the  energy 
levels.  The  usual  expression  for  the  potential  energy  of  an  electron  in  the  field  of  a 
nucleus 


(22.20) 

is  replaced  by 

V  =  -  efl(r+  Ar)  =  ~  e0  [1  +  (Ary)  +  \  (^v)2  +...]*  (r).          (22.21) 
Changing  to  the  average  values  and  using  the  relations 

V,  (22.22) 


we  obtain 


+  ...  }*(r).  (22.23) 

The  additional  energy  of  interaction  between  the  electron  and  the  nucleus  is  given  by 

n    '<'>•      <22-24> 


since  the  Coulomb  potential  of  the  hydrogen  nucleus  satisfies  the  Pols  son  equation 

(22.25) 


To  obtain  an  expression  for  the  shift  of  energy  levels  in  the  hydrogen  atom,  it  is  necessary 
to  average  the  additional  interaction  energy  over  the  corresponding  state 


Wp.  p.=     (V  -  V)  I  «•  (r)  I*  d*x  =      c*a  \  *  (0)  |«  In       .  (22.26) 

ITiis  shift  occurs  only  for  s  states,  since  the  quantity  1  1  (0)  |ain  the  approximation  under 
consideration  vanishes  for  other  states  (/=  1.  2,  3t  ...),  whereas  for  the  s  state 

,  (22.27) 


where  aQ  =  ft2  /MO??  is  the  radius  of  the  first  Bohr  orbit  of  the  hydrogen  atom,  and  n  is  the 
principal  quantum  number.  If  we  now  substitute  this  value  into  (22.26),  we  obtain  a 
formula  for  the  s  -level  shift 

A£p.p  =  3^'f  in?.  (22.28) 

This  formula  was  first  derived  by  Bethe. 
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A  substitution  of  numerical  values  for  the  2s  state  (n  =  2)  gives 

A£p-  p^  =  1 7,8/?  =  1040  Me  .  (22.29) 

This  is  in  fairly  good  agreement  with  recent  experimental  data  (A£  =  1057.77  Me)  for 
the  Lamb  shift  (see  Chapter  20 X 

A  complete  study  of  the  shift  of  energy  levels  of  atomic  electrons  with  the  help  of 
relatlvistic  quantum  field  theory  gives  considerably  better  agreement  between  theoretical 
and  experimental  results  than  the  semiclasslcal  formula  (22.28),  The  discrepancy  is 
reduced  to  less  than  1  Me. 

For  the  sake  of  brevity,  we  shall  not  discuss  the  modern  theory  of  the  vacuum  in 
greater  detail,  and  we  shall  confine  ourselves  to  a  mere  enumeration  of  the  main  results 
that  this  theory  has  yielded. 


C.    ELECTRON-POSITRON  VACUUM 

Equation  (22.28)  for  the  Lamb  shift  was  obtained  from  a  calculation  of  the  interaction 
between  electrons  and  the  electromagnetic  vacuum.  In  addition  to  the  electromagnetic 
vacuum,  there  exists  an  electron-positron  vacuum  and  also  vacuum  states  of  other 
particles.  The  method  of  second  quantization  (which  is  in  some  measure  applicable  to 
all  fields)  can  be  used  to  calculate  the  influence  of  the  electron- positron  vacuum. 

In  modern  quantum  field  theory,  the  study  of  the  properties  of  the  vacuum  states  of 
different  particles  plays  a  particularly  important  role.  The  vacuum  gives  rise  to  inter- 
actions between  particles.  In  particular,  the  electromagnetic  interaction  of  two  electrons 
(Coulomb's  law)  may  be  regarded  as  an  interaction  which  takes  place  through  the  electro- 
magnetic vacuum,  with  one  electron  emitting  a  virtual  photon  and  the  other  electron 
absorbing  it. 

On  the  other  hand,  the  vacuum  represents  a  sort  of  "reservoir,"  from  which  real 
particles  are  "drawn"  when  they  are  created,  and  to  which  they  "return"  when  they  are 
annihilated.  We  have  already  come  across  the  electron-positron  vacuum  as  a  "collection" 
of  electrons  in  negative  energy  states.  Unfortunately,  it  has  no  classical  analog  and 
therefore  we  cannot  use  a  semiclassical  analysis,  as  was  possible  in  the  case  of  the 
electromagnetic  vacuum.  The  Coulomb  field  of  a  nucleus  can  polarize  the  electron- 
positron  vacuum  (so  that  an  electron  will  behave  as  if  it  were  in  a  dielectric),  giving  rise 
to  an  additional  interaction 

A  t     *      \  2 

r).  (22.30) 

Comparing  (22.30)  with  (22.24),  we  see  that  the  level  shift  due  to  interaction  with  the 
electron-positron  vacuum  is  about  1/40  times  as  large  as  the  level  shift  due  to  the  fluc- 
tuations of  the  electromagnetic  field,  and  is  opposite  in  sign. 

The  electron-positron  vacuum  exerts  a  particularly  strong  influence  on  the  magnetic 
properties  of  the  electron.  It  was  shown  by  Schwinger  that  the  magnetic  moment  of  the 
electron  becomes  somewhat  larger  than  the  Bohr  magnetron 

(22.31) 

The  change  in  the  magnetic  moment  of  the  electron,  with  the  secondp-order  term  taken 
into  account, 

AHe-_p==  —  (g;  —  0.328  ^  \  ji0 «»  —  0.001 1596  (&„  (22.32) 

is  in  good  agreement  with  experimental  data  obtained  with  the  aid  of  microwave  spectro- 
scoplc  methods  (see  Chapter  21X  Qualitatively,  the  appearance  of  the  additional  magnetic 
moment  of  the  electron  and  the  sign  of  the  moment  can  be  explained  as  follows.  The  initial 
electron  A  (see  Fig.  22.4),  whose  spin  is  directed  upwards  (we  specify  in  this  manner  a 
preferred  direction),  creates  a  virtual  pair— electron  A}  and  positron  5'— which  generally 
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have  opposite  spin  directions.  Since  we  have  a  preferred  direction  (determined  by  the 
spin  of  electron  A)  the  spins  of  electron  A'  and  positron  £'  can  be  directed  in  two  ways: 
either  the  spin  of  A  will  be  antiparallel  to  the  spin  of  A'  and  parallel  to  the  spin  of  B1 
(case  1  in  Fig.  22.4),  or  the  spin  of  A  will  be  parallel  to  the  spin  of  A  and  antiparallel 
to  the  spin  of  £'  (case  2  in  Fig.  22.4). 

In  determining  the  additional  magnetic  mo- 
ment, it  is  necessary  to  consider  the  following 
possibilities  of  pair  creation  and  annihilation: 

(a)  Pair  A B'  is  created  and  then  annihilated. 
This  is  possible  in  case  1  and  case  2.  No  cor- 
rection to  the  magnetic  moment  should  arise, 
because  the  probability  of  the  creation  process 
is   the   same  in  both  cases,  and  the  magnetic 
moments  have  opposite  signs  in  the  two  cases. 

(b)  Since   .4  and  B*  have  opposite  spins  in 
case  2,  positron  B[  may  be  annihilated  together 
with  A\  as  well  as  with  the  initial  electron  A. 
In  case  1,  however,  this  process  is  less  prob- 
able,   since    the   spins   of  pair    A  and  /?'  are 
parallel  to  one  another.   This  process  gives  a 
preferred  state  in  which  the  additional  magnetic 
moment  is  parallel  to  the  initial  magnetic  mo- 
ment of  electron   A,  since,  as  can  be  easily 
seen,  the  magnetic  moment  of  all  three  particles 
(A,  A'  and  Bl)  is  directed  downwards.   Conse- 
quently,  there   arises  an  additional  magnetic 
moment  of  the  electron,  equal  to 


I 


if. 


Antiparallel 
spins  of 
electrons 
A  and /I7 


2 

Parallel 
spins  of 
electrons 
A  and  A1 


Fig.  22.4  Virtual  creation  of  an  elec- 
tron-positron pair  by  an  electron  The 
spins  of  the  real  and  virtual  particles 
are  depicted  by  solid  and  dashed  ar- 
AH-e.-p.  t==  —  2fVf,  rows,  respectively 

where  f  is  a  numerical  factor  which  determines  the  probability  that  the  initial  electron 
A  will  be  annihilated  together  with  positron  B". 

These  simple  considerations  give  the  correct  sign  of  this  anomalous  magnetic  mo- 
ment. More  rigorous  calculations  led  to  Eq.  (22.32). 

In    addition    to  the   Dirac  magnetic  moment,  ji=j~--fi03,  an  electron  will  have  an 


anomalous   magnetic  moment  ji      =•  —  ,>-! 


whlch  arises  owing  to  the  interaction 


between  the  electron  and  the  electron-positron  vacuum. 

The  anomalous  magnetic  moment  of  nucleons  (neutrons  and  protons)  can  be  explained 
in  meson  theory.  The  accuracy  of  the  results,  however,  is  much  less  striking  than  in 
calculations  of  the  anomalous  magnetic  moment  of  the  electron.  This  is  due  to  the  fact 
that  meson  theory  is  still  in  a  much  less  satisfactory  state  than  quantum  electrodynamics. 

According  to  the  meson  theory,  protons  and  neutrons  interact  with  the  ir-meson  field. 
Since  the  proton  can  dissociate  into  a  neutron  and  a  positively  charged  TT  meson,  and  the 
neutron  into  a  proton  and  a  negatively  charged  IT  meson,  the  anomalous  magnetic  moments 
of  protons  and  neutrons  due  to  the  ir-meson  field  must  be  approximately  equal  in  magni- 
tude and  opposite  in  sign. 

The  appearance  of  the  additional  magnetic  moment  of  a  neutron  is  explained  as  follows. 
The  neutron  has  a  definite  probability  of  dissociating  into  a  proton  and  a  ir  meson.  Since 
the  spin  of  a  TT  meson  equals  0, it  does  not  possess  an  intrinsic  magnetic  moment.  There- 
fore, there  will  be  a  contribution  to  the  magnetic  moment  of  the  neutron  from  a  pion 
which  is,  for  example,  in  a  p  state  (we  recall  that  the  orbital  angular  momentum  In  the 

s  state  is  also  equal  to  0).  The  magnetic  moment  of  a  v  meson  is  ^  «=»  7  times  greater 

m« 
than  the  nuclear  magneton,  and  therefore  the  ir  meson  produces  the  main  contribution  to 

the  anomalous  magnetic  moment  of  the  neutron.  For  the  virtual  process  to  fulfill  the 
law  of  conservation  of  angular  momentum,  it  is  necessary  that  the  direction  of  the 
orbital  angular  momentum  (equal  to  1)  of  the  virtual  Trmeson  should  coincide  with  the 
direction  of  spin  of  the  neutron,  while  the  spin  of  the  virtual  proton  (equal  to  Va,  Just 
like  the  spin  of  the  neutron)  should  be  directed  opposite  to  the  spin  of  the  neutron  so  that 

Sfl  —  Sn  —  spt  s/t  =  sp=™,  3«  =  1-  Since  the  w  meson  has   a  negative  charge  and  its 

angular  momentum  is  parallel  to  the  spin  of  the  neutron,  it  gives  rise  to  the  negative 
magnetic  moment  of  the  neutron. 
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In  order  to  explain  the  magnitude  of  the  anomalous  magnetic  moment  of  the  neutron, 
one  must  assume  that  the  neutron  spends  !/«  of  its  time  in  its  dissociated  state.  This 
estimate  is  perfectly  reasonable  and  gives  the  correct  sign  and  order  of  magnitude  of  the 
magnetic  moment  of  the  neutron (|x£n  ~  —  2p.nuc).  Similarly,  it  can  be  shown  that  the  dis- 
sociation of  the  proton  into  a  neutron  and  a  n+  meson  gives  rise  to  a  positive  anomalous 
magnetic  moment  of  the  proton  (\>*n^  -}-  2unuc).  Moreover,  the  proton  has  a  Dirac  mag- 
netic moment  (M-jp  — l^nucX  More  accurate  data  on  the  magnitude  of  the  magnetic  moment 
of  the  proton  and  neutron  are  given  in  the  preceding  chapter. 

Thus  we  have  a  basis  for  regarding  the  anomalous  magnetic  moment  of  Dirac  particles 
as  a  secondary  effect  which  can  be  explained  on  the  basis  of  the  field  theory.  This 
moment  does  not  appear  in  the  initial  equations,  but  arises  as  a  result  of  the  interaction 
either  between  the  electric  charge  and  the  electron-positron  field  (in  the  case  of  electrons), 
or  between  the  nuclear  charge  and  the  7r-meson  field  (in  the  case  of  protons  and  neutrons). 


D.  RENORMALIZATION 

One  of  the  most  important  sections  of  modern  quantum  field  theory  is  concerned  with 
the  question  of  renormalization,  This  subject  is  not  yet  in  a  mathematically  satisfactory 
state,  but  a  number  of  important  results  have  been  obtained. 

The  basic  idea  involved  In  renormalization  is  the  separation  of  finite,  observable 
terms  from  the  divergent  terms  describing  the  interaction  between  an  electron  and  the 
electromagnetic  or  electron-positron  vacuum. 

In  effect,  the  question  of  renormalization  first  arose  in  classical  electrodynamics— 
for  example,  in  the  theory  of  the  electromagnetic  mass  of  the  electron.  If  we  assume 
that  the  electron  charge  is  distributed  inside  a  sphere  of  radius  r0,  its  classical  electro- 
magnetic mass  will  be  equal  to 

m  field  =7-^f  (22.33) 

r0C 

where  7  is  a  factor  of  the  order  of  unity  which  depends  on  the  charge  distribution  inside 
the  sphere.  The  attempts  to  construct  a  classical  electrodynamics  in  which  a  finite 
radius  r0~  10  n  cm  would  give  a  reasonable  value  for  wfield  (the  Lorentz  theory,  the 
nonlinear  Bom-lnfeld  theory,  the  Boppe-Podolsky  theory  with  higher  derivatives)  did  not 
give  any  satisfactory  results.  Moreover,  all  these  theories  gave  rise  to  fundamental 
difficulties  in  connection  with  quantization. 

On  the  other  hand,  the  theory  of  a  point  electron  leads  to  an  Infinite  value  of  the  mass 
as  r0  —  »  0  both  in  the  classical  and  quantum  forms  of  the  theory.  It  was  therefore  a  major 
achievement  of  renormalization  in  modern  quantum  field  theory  that  It  was  able  to 
separate,  from  the  infinite  interaction  energy,  finite  terms  associated  with  the  Lamb 
shift  of  the  atomic  levels  and  the  additional  magnetic  moment  of  the  electron. 

Modern  renormalization  theory  extends  to  the  problem  of  the  self-mass  and  charge 
of  the  electron.  It  has  been  found  that  when  the  electron-positron  vacuum  and  the  electro- 
magnetic vacuum  are  taken  into  account,  the  electromagnetic  mass  (22.33)  drops  out 
completely.  Therefore,  the  main  mass  of  the  electron  (the  so-called  "bare"  mass)  will 
not  be  associated  with  the  electromagnetic  field.  The  interaction  with  the  vacuum  (in  this 
case  the  main  contribution  is  obtained  when  the  electromagnetic  vacuum  and  the  electron- 
positron  vacuum  are  taken  into  account  simultaneously)  yields  an  additional  mass  which 
diverges  only  logarithmically 


Renormalization  is  faced  with  the  important  question  of  finding  the  value  c  max  at 
which  the  logarithmically  divergent  expression  should  be  cut  off.  This  problem  is  a 
theoretically  important  one,  although  in  practice  the  logarithmically  divergent  terms  may 
be  regarded  to  be  of  the  order  of  unity,  and  therefore  the  quantity  Am  will  remain  of  the 
order  m  bare  /1  37  over  a  comparatively  wide  range  of  values  «max. 
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In  exactly  the  same  way,  the  interaction  of  the  electron  with  the  electron-positron 
vacuum  (rather  than  with  the  electromagnetic  vacuum)  leads  to  a  decrease  of  its  "bare 
charge"  by  the  amount 

(22.35) 

v         ' 


-  . 

bare      w0c8 

The  field  corrections  to  the  mass  and  charge  of  the  electron  have  not  yet  been  experi- 
mentally separated,  and  this  important  subject  requires  further  investigation.6 

When  can  the  emission  of  photons  take  place  for  the  free  motion  of  an 

with  index  of  refraction  n  ^  \  (the  Cherenkov  effect)? 
Why  does  this  emission  become  impossible  in  a  vacuum  (n-=z  1)? 
Solution.    In  order  for  the  emission  to  be  possible,  it  is  necessary  that  the  laws  of 
conservation  of  energy 


and  momentum 


be   satisfied,  where  p  and  p1  are  the  initial  and  final  moments  of  the  electron  and  nkis 
the  photon  momentum. 

Squaring  these  relations  and  then  subtracting  one  from  the  other,  we  obtain  the  follow- 
ing expression  for  the  cosine  of  the  angle  of  photon  emission: 


Using  the  relation  for  the  index  of  refraction  n  of  the  medium 

•  ==««==  c»»jfe  =  ?*? 
n 

*  =  —  is  the  phase  velocity  of  light  in  the  medium),  we  obtain 


Radiation  can  take  place  when  p/i  >  1  (that  is,  when  the  electron  velocity  v  remains  less 
than  the  phase  velocity  of  light  in  vacuum,  but  becomes  greater  than  the  velocity  of  light 
in  the  medium  c  ^>vf  ^>  c1).  In  the  classical  case  (ft  =  0),  the  angle  at  which  the  photon  is 
emitted  satisfies  the  relation 


The    emission    of   radiation    in   vacuum    is  impossible  because  for  n  =  1  we  have 
cos  0  =  —  ,  and  the  velocity  v  cannot  become  greater  than  the  velocity  of  light  c. 


See  G  Wentzel,  Quantum  Theory  of  Fidd*,  New  York-  Interscience,  1949,  A  I. 
Akhiyezer  and  V  B.  Berestetskiy,  Element s  of  Quantum  Electrodynamics  (trans  ),  London: 
Oldboume  Press,  1962,  N  N  Bogolyubov  and  D  V  Shirkov,  Introduction  to  the  Theory  of 
Quantized  Fields  (trans  ),  New  York:  Interscience,  1959 


Chapter  23 

Theory  of  the  Helium   Atom   Neglecting 
Spin  States 


A,   BASIC  PRINCIPLES  OF  THE  THEORY  OF 
>  ^  MULTIELECTRON  ATOMS 

The  helium  atom  is  the  simplest  multielectron  atom;  it  con- 
sists of  two  electrons  moving  about  a  nucleus  of  charge  Z  =  2.  In 
spite  of  its  simplicity,  this  system  exhibits  several  important 
features  characteristic  of  the  many-body  problem  in  quantum 
mechanics. 

In  classical  theory  two  electrons  can  always  be  identified  by 
subscripts  1  and  2,  and  the  motion  of  each  electron  can  be  followed 
separately.  According  to  quantum  theory  two  electrons  can, 
in  practice,  be  distinguished  from  each  other  only  when  the  dis- 
tance between  them  is  large.  If  electrons  1  and  2  are  so  close 
to  each  other  that  there  are  points  in  space  where  their  wave 
functions  are  both  simultaneously  different  from  zero,  then, 
since  electrons  are  identical  particles,  we  shall  be  unable  to 
distinguish  whether  an  electron  occurring  at  the  point  is  electron  1 
or  2. 

This  indistinguishability  or  identity  of  particles  is  a  special 
feature  of  quantum  theory.  It  gives  rise  to  the  so-called  exchange 
forces,  which  have  no  classical  analog. 

In  multielectron  atoms,  spin  properties  are  very  important; 
these  properties  are  neglected  in  the  classical  and  Bohr  theories 
and  are  taken  into  account  only  in  quantum  mechanics.  In  fact, 
only  in  the  case  of  a  one-electron  atom  can  the  spin  corrections  be 
neglected  in  the  first  approximation.  This  explains  why  the  pre- 
dictions of  the  Bohr  theory  are  applicable  only  to  hydrogen-like 
atoms.  The  Bohr  theory  could  not  be  extended  to  atoms  with  two 
or  more  electrons,  since  it  could  not  account  for  either  the  ex- 
change forces  or  the  spin  states. 

In  order  to  investigate  the  basic  features  of  the  quantum  theory 
of  the  many-body  problem,  we  shall  consider  in  some  detail  the 
problem  of  helium- like  atoms  (for  example,  neutral  helium,  singly 
ionized  Li*,  doubly  ionized  Be++,  and  so  on). 
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B.-  SOLUTION  OF  THE  PROBLEM  OF  THE  HELIUM  ATOM 
VIA  METHODS  OF  PERTURBATION  THEORY 

To  start  with,  let  us  determine  the  physical  nature  of  the  ex- 
change forces  which  are  related  to  the  indistinguishability  of 
electrons.  We  shall  neglect  the  spin  properties  of  the  particles.1 

We  shall  assume  that  the  position  of 
the  first  and  second  electrons  is  given 
by  the  position  vectors  r\  and  r4,  re- 
spectively (their  origin  coincides  with 
the  stationary  nucleus;  see  Fig.  23V1).  For 
the  sake  of  brevity,  states  with  quantum 
numbers  (HI,  llt  MI)  and  (n*f  /2,  w2)  will 
be  denoted  by,  respectively,  n^  and  m  (in 
this  notation,  therefore,  n  stands  for  the 
whole  set  of  quantum  numbers).  If  there 
were  no  interaction  between  the  electrons, 
we  could  determine  the  motion  of  each 
electron  separately  from  the  Schrodinger 
equation 


Fig    2.3  1    The  helium  atom 


(23.1) 


where 


(23.2) 


/  takes  two  values:  /=!  in  the  case  of  the  first 
in  the  case  of  the  second  electron*   From  these 


and  the  subscript 
electron,  and  /  = 
equations  we  can  obtain  the  energy  values  Enj  (see  Chapter  13) 


(23.3) 


The  eigenfunctions  bnj   will  be  the  same  as  the  wave  functions  of  a 
hydrogen-like  atom;  they  will  satisfy  the  orthonormality  condition 


=  ^f.  (23.4) 

If  we  now  take  into  account  the  interaction  of  the  two  electrons 


V'=- 


(23.5) 


We  are  allowed  to  do  this  because  in  this  approximation  the  problem  has  a  solution 
in  the  form  of  a  product  of  two  functions,  depending  on  the  spatial  and  spin  coordinates, 
respectively  (see  Chapter  24). 
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their  motion  can  no  longer  be  regarded  as  independent.  The 
Hamiltonian  of  the  complete  system  will  be 

H  =  Hl  +  H2  +  Vf=H°  +  K'l  (23.6) 

and  to  describe  this  system  we  must  take  the  Schrbdinger  equation 
in  the  form 

r,)  =  0,  (23.7) 

where  E  is  the  total  energy,  and  <]>(/*!,  ra)  is  the  total  wave  function, 
which  depends  on  the  coordinates  of  both  electrons.  The  quantity 
'I^to.  ^)'Mri»  ri)  characterizes  the  probability  of  finding  the  first 
electron  at  position  r\  and  the  second  electron  at  position  r2. 
Therefore,  the  normalization  condition  for  tyfa,  r*)  takes  the  form 

J  t*(rlf  r,Wi.  r,)d^d^=lt  (23.8) 

where  the  integration  extends  over  the  coordinates  of  both  particles. 

Since  the  exact  solution  of  Eq.  (23.7)  entails  insurmountable 
difficulties,  we  shall  use  the  perturbation  theory3  developed  in 
Chapter  14,  assuming  that  the  mutual  interaction  of  the  electrons 
causes  only  a  small  change  in  their  individual  motions  in  the 
Coulomb  field  of  the  nucleus  (the  justification  for  this  approxima- 
tion will  be  examined  in  greater  detail  later). 

We  shall  first  consider  the  zeroth  approximation,  in  which  the 
perturbation  energy  V  can  be  neglected.  The  Schrbdinger  equation 
(23.7)  becomes 

(&-W)p(rl9  r,)  =  0.  (23.9) 

Since  the  zero-order  Hamiltonian  H°  can  be  resolved  into  a 
sum  of  two  Hamiltonians  Ht  +  H4,  each  of  which  depends  only  on 
a  single  variable  (either  rt  or  ra),  it  is  obvious  that  in  the  zeroth 
approximation  the  wave  function  may  be  written  in  the  form  (i|)°  =  #) 


r4).  (23.10) 

Substituting  (23.10)  into  (23.9)  and  using  (23.1),  we  get 


(£•  -  11°)  K  =  {£*  -  (Hi  +  H4)}  <k  (rO  4m,  (/•„)  = 

*«i  (/-i)Hrf«a  (r4)}  = 
^  (r,)  f^.  (r9)} 


The    problem   of  the  helium-like  atom   is   a    three-body   problem   and  cannot   be   solved 
exactly,  even  in  the  classical  approximation. 
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Hence,  the  energy  in  the  zeroth  approximation 

£°  =  EWl  +  £«t,  (23.11) 

where  E ni  and  £*2  are  the  energies  of  the  two  electrons  on  the  as- 
sumption of  no  mutual  interaction.  This  result  can  be  explained  as 
follows.  In  the  absence  of  the  perturbation  V\  the  motion  of  the 
electrons  is  determined  by  their  interaction  with  the  nucleus  ZeQ; 
that  is,  their  motion  is  completely  described  by  the  Schrbdinger 
equation  (23.1)  which  has  the  eigenvalues  Enj  [see  (23.3)]  and 
eigenfunctions  4>«; .  Since  one  of  the  electrons  is  in  a  state  n\  and 
the  second  in  a  state  /z*,  the  total  energy  of  the  system  is  Eni  -\-  En^ 
when  V'  =  Q. 

Since  the  two  electrons  move  independently  of  one  another,  the 
total  wave  function  will  be  the  product  of  the  corresponding  two 
independent  one-electron  wave  functions  obtained  in  the  one- 
electron  problem.  By  direct  substitution  into  Eq.  (23.9)  it  is, 
however,  easy  to  show  that,  in  addition  to  the  first  solution  (23.10), 
there  will  be  a  second  solution  (fy*  =  v)  corresponding  to  the  same 
energy  eigenvalue  (23.11): 

^^(r,)*,,,  to).  (23.12) 

This  solution  differs  from  u  by  a  permutation  of  the  electrons.  The 
first  electron  is  now  in  state  n*  and  the  second  electron  is  in 
state  HI. 

This  state  of  the  system  has,  therefore,  an  additional  degeneracy 
which  is  due  entirely  to  the  indistinguishability  of  the  electrons; 
this  is  known  as  exchange  degeneracy. 

If  both  electrons  are  in  identical  states  (wi=/za),  the  wave 
functions  u  and  v  are  identical  and  there  will  be  no  exchange  de- 
generacy since 

u  =  v  =  <(>„,  (r,)  ^(rt).  (23.12a) 

In  the  case  MI  ^  /i* ,  however,  functions  u  and  v  are  different,  and 
therefore  the  following  linear  combination  should  be  taken  as  the 
general  solution  fy°  of  the  Schrodinger  equation  (23.9): 

f  =  Ctu  +  CiV9  (23.13) 

where  Cv  and  C2  are  arbitrary  constant  coefficients,  which  are 
related  only  by  the  normalization  condition 


In  order  to  find  the  values  of  the  coefficients  Q  and  C2  and  the 
energy    E  of  the  perturbed  system  (the  system  with  V  taken  into 
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account),  according  to  the  perturbation  theory,  we  shall  look  for 
the  solutions  of  E  and  '!>  in  the  form: 


(23.14) 

We  use  a  first  approximation  of  the  Schrodinger  equation  (23.7), 
and  write  it  in  the  form 

(£°  —  H°)  <[/  =  -.(£'  —  I/')  (C!«  +  C2iF).  (23.15) 

From  Chapter  14,  we  recall  that  the  solution  of  the  homogeneous 
equation  for  the  unperturbed  problem  must  be  orthogonal  to  the 
right-hand  side  of  the  inhomogeneous  equation  for  the  first-order 
approximation  of  the  wave  function  [see  (14.13)].  Since  the  functions 
u  and  v  are  solutions  of  the  unperturbed  problem,  we  have 

u  *  (£'  —  V)  (Citi  +  C&)  dil*i  d^t  =  0,  (23.  16) 


V*  (E'  —  V)  (C,«  +  C2u)  rf3^  rf3A:a  =  0.  (23.17) 

In  (23.17),  let  us  substitute  r*  for  r,  and  /*i  for  rl9  The  function 
0  [see  (23.12)]  then  becomes  u  [see  (23.10)],  and  the  function  v 
becomes  //  ,  while  the  perturbation  energy  V  remains  unchanged 
since  |r,  —  ra|  =  |r,,  —  rv  \.  Thus  Eq.  (23.17)  takes  the  form 

f  «  >  (£'  —  K')  (C4«  +  C»  c/^2  c/U'i  =-  0.  (23.  17a) 

It  is,  therefore,  sufficient  to  consider  only  Eq.  (23.16)  since  the 
results  can  be  extended  to  Eq.  (23.17a)  by  substituting  Ct  ->  C2  and 
C4~>Ct. 

Let  us  substitute  into  Eq.  (23.16)  the  explicit  expressions 
(23.10)  and  (23.12)  for  u  and  v,  and  introduce  the  notation 

(23.18) 
(23.19) 
(23.20) 
(23.21) 

Here  Pnfo)  and  P««W  characterize  the  probability  density  distri- 
bution of  electrons  in  the  states  'h  and  n*,  respectively,  whereas 
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plt  (rt)  and  p^i  (/*«)  characterize  the  so-called  density  of  mixed  (or 
exchange)  states,3  when  each  electron  is  partially  instate  rc,  and 
partially  in  state  n8. 

The  orthonormality  condition  gives 


J  u*  u  d*Xt  d*x*  =  J  pn 
and 

J  «*y  d3*!  d3x2  =  j  Pli  (rO  d3*!  J  pM  (rO  d8*  =  0, 
and  therefore  we  can  reduce  (23.16)  to  the  form 

EC,  -  [Cfl  J  Jil^M(pL  ^  ^  + 


(23-22) 


The  first  integral  in  Eq.  (23.22)  is  simply  the  Coulomb  interaction 
of  two  "smeared-out"  electrons 

u      /,s  f    Pn  (ri)  pas  (ra)  jtv  MY  /OQ  Oo\ 

A  =eQ  \  —  .y  _^ri  —  "  ^1  "  ^2-  (Zo.ZoJ 

The  second  integral  gives  the  so-called  exchange  energy 

t 

A  =  el     "7,  -'          d'^  ***  (23.24) 


corresponding  to  the  interaction  of  two  electrons  when  each  is  in 
a  mixed  state  n\  and  na.  Unlike  the  Coulomb  energy  K,  the  ex- 
change energy  A  has  no  classical  analog;  it  is  essentially  a 
quantum- mechanical  concept. 

Using  Eqs.  (23.23)  and  (23.24),  we  obtain  instead  of  Eq.  (23.22) 
the  following  expression: 

C,(E'  —  K)  —  C2/1=0.  (23.25) 

From  Eq.  (23.25),  we  obtain  the  equation  corresponding  to  (23.17a) 
by  substituting  Ct  ->  Ct  and  C\  -*  C2 : 

C2  (£'  ~  /<)  —  dd  =  0.  (23.26) 


Those  densities  have  no  classical  analog 
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Equations  (23.25)  and  (23.26)  give 

1)  E'  =  K  +  A,  C,  =  C2>  (23.27) 
and 

2)  e  =  K  —  A,  Ci  =  — C2.  (23.28) 

Accordingly,   we   have   two   solutions    for   the   wave  function  [see 
(23.13)]  and  for  the  total  energy 

1)  symmetric 

^  =  C,(«-{-iO,  (23.29) 

£>  =  £»  +  /(-}- 4,  (23.30) 

and 

2)  antisymmetric 4 

Vl  =  Ci(u  —  v),  (23.31) 

EA  =  F  +  K  —  A.  (23.32) 

vVe    can   determine   the    coefficient   Ct  from  the  normalizatior 
condition  of  the  wave  functions  <]>s  and    y1: 


Hence,  2Cj  =  1    or  Ci  =  ---  .  Thus  for  <]>s  and  ^  we  finally  have 


t=~(tt  +  0),  (23.29a) 

*'l=^  («-*)-  (23.31a) 

When  both  electrons  are  in  the  same  quantum  state  (MI  —  /«;), 
the  functions  //  and  v  are  identical.  In  this  case  Eqs.  (23.16)  and 
(23.17)  reduce  to  the  same  equation 


0.  (23.33) 

From  this  it  is  readily  seen  that 

F  =  ff,  (23.34) 


We  recall  that  under  a  permutation  of  coordinates  (that  is,  when  r^   and  r2  are  inter- 
changed) the  function  u  and  u  transform  into  one  another    The  wave  function  0s*  does  not 
change   sign  as  a  result  of  this  operation  (symmetric  function),   whereas  the  function   ^ 
reverses  its  sign  (antisymmetric  function) 
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and  thus  the  exchange  energy  disappears  in  this  case.  For  the  wave 
function  we  obtain  a  unique  symmetric  solution 


'ii),  (23.35) 

with  the  corresponding  energy  of  the  system 

(23.36) 


Summarizing  the  results  of  this  section,  we  can  make  the  fol- 
lowing statement:  Application  of  the  perturbation  method  to  the  prob- 
lem of  the  helium  atom5  leads  to  two  types  of  solutions—  a  sym- 
metric or  antisymmetric  solution.  This  is  in  complete  agreement 
with  the  general  theory  of  systems  of  identical  particles  (see  below). 


C.  -COULOMB  INTERACTION  BETWEEN  ELECTRONS 

Let  us  find  an  expression  for  the  Coulomb  energy  of  two 
electrons  in  the  lowest  energy  state  (ni  =  n1=l).  In  this  case  the 
energy  and  the  wave  function  of  each  electron  are  given  by 


<23-3" 


_ 
where  °o  —  j^^r  is  the  radius  of  the  first  Bohr  orbit. 

The  Coulomb  energy  of  interaction  between  the  two  electrons  is 
*  =  J  *!  (rt)  «  (r,)  ^-i  tPxid'x*  (23.38) 

Here  I  rv  —  r2  1  =  Yr\  +  rl  —  2/ya  cos  0  ,  and  &  is  the  angle  between  the 
vectors  r\  and  r«.  To  integrate  (23.38),  let  us  choose  the  direction 
of  the  z  axis  along  the  direction  of  the  vector  r\  .  Substituting  the 
expression  (23.37)  for  the  wave  functions  into  Eq.  (23.38)  and  in- 
tegrating over  the  angles,  we  find6 


This  is  due  to  the  fact  that  the  perturbation  removes  the  degeneracy,  and  therefore 
the  coefficients  that  were  indeterminate  in  the  ^eroth  approximation  can  now  take  specific 
values  (See  also  Chapter  14,  the  Stark  effect  ) 

In   integrating  over  the  angle    $  (x  =•   cos  d),  we  used  the  relation 


, 
(continued) 
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drt.  (23.38a) 

* 

Next,  Integrating  over  r,  and  /•,,  we  finally  obtain 

K  =  4.M.  (23.39) 

o     a0 

Since  the  zero-order  energy  is 

E*  =  2£1  =  —  ^f  (23.40) 

flo 

the  total  energy  of  two  electrons  in  the  ground  state  is  equal  to 

B==£«  +  /C  =  -^-  +  |z^i.  (23.41) 

Let  us  now  find  the  ionization  energy  of  the  helium  atom,  that 
is,  the  energy  that  must  be  expended  in  order  to  remove  one 
electron  from  the  first  orbit.  For  a  singly  ionized  helium  atom 
(a  hydrogen-like  atom)  the  bonding  energy  between  the  electron  and 
the  nucleus  is  simply  El  [see  (23.37)].  The  ionization  energy  of  a 
helium-  like  atom  is  therefore 

£Jon._  £l-E  =  ^  (ll  -  1-  Z)  ,  (23.42) 

so  that  for  helium  (Z  =  2)  we  have 

£ior  =0.75r!-.  (23.43) 

"o 

According  to  experimental  data,  the  ionization  energy  of  helium 

is 

E^S  =0.9  ^  =  24.48  ev.  (23.43a) 

r  OQ 

The  discrepancy  between  the  theoretical  value  and  the  experimental 
data  is  a  result  of  the  fact  that  the  perturbation  energy  K=  4~ 

is  not  very  small  as  compared  with  the  zero-order  energy  |  E°|  =~? 
(their    ratio    is  ~l/3).    Perturbation   theory,  therefore,  gives  us 


Considering  that  tho  integrand  l/ri(ri)  i//f(/2)  is  symmetric  with  respect  to  the  variable 
T\  and  r2,  we  replace  (for  rj    "»"*  r2)  the  radius  rt  by  r2  and  vice  versa,  obtaining 


0       for    /j    >  r2 
This  expression  was  used  in  evaluating  the  integral  (23  38) 
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only  qualitative  aspects  of  the  problem.  The  accuracy  of  the  method 
is  not  very  great  because  K  and  £° have  the  same  order  of  magnitude. 

D.    THE  VARIATIONAL  METHOD 

The  variational  method  developed  by  Ritz,  Hylleraas  and  others 
was  first  successfully  used  to  find  the  ground -state  energy  of 
atomic  systems,  and  in  recent  years  ithasbeen  applied  in  collision 
theory. 

As  we  know,  the  average  energy  of  anatomic  system  is  given  by 


(23.44) 
If  the  wave  function  is  represented  as 

*  =  ££„<!*„,  (23.44a) 

where  the  coefficients  Cn  give  the  probability  of  an  electron  being 
in  state  n ,  the  average  value  of  the  energy  will  be  given  by  the 
relation  [see  (7.21)] 

!*£«•  (23.45) 

Replacing  each  eigenvalue  En  in  the  summation  by  the  lowest 
eigenvalue  £min  and  using  the  fact  that  for  normalized  functions 


we  find  that 

£™in<  f  <i»*H']»d3x;  (23.46) 

that  is,  the  lowest  value  of  the  integral  \  ty*W>(Px  can  be  used  to 

determine  the  upper  limit  of  the  ground- state  energy  of  the  system. 
It  was  found  that  the  variational  method  gives  very  good  results 
when  the  perturbation  energy  E'  is  of  the  same  order  as  the  energy 
E0  of  the  zero-order  approximation.  The  variational  method  can 
therefore  be  used  in  cases  where  perturbation  theory  gives  poor 
results  (for  example,  in  calculation  of  the  ground- state  energy  of 
the  helium  atom).  When  a  problem  is  solved  by  the  variational 
method,  both  the  additional  interaction  V  and  the  main  interaction 
are  treated  equally  In  the  Hamiltonian  II  of  Eq.  (23.7).  A  test 
function  <j»  then  depends  on  several  parameters  and  is  selected 
in  such  a  way  that  the  integral  can  be  calculated  exactly.  The 
energy  £  will  then  be  a  function  of  these  parameters  and  it  is 
obvious  that  the  minimum  value  of  this  function  will  be  close  to 
the  true  value  if  the  test  function  resembles  the  true  function. 
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The  most  difficult  part  of  the  method  lies  in  the  choice  of  the 
best  test  function.  All  the  available  information  on  the  properties 
of  the  system  must  be  used  in  making  this  choice.  It  is  impossible 
in  the  general  case  to  indicate  the  form  of  the  test  function,  and 
therefore  it  is  frequently  necessary  to  rely  on  physical  and 
mathematical  intuition.  Very  often,  a  test  function  is  similar  in 
form  to  the  solution  of  the  unperturbed  equation. 

We  shall  now  use  the  variational  method  to  calculate  the  ground- 
state  energy  of  the  helium  atom.  Our  procedure  will  be  based  on 
that  of  Hylleraas  (1927).  At  the  end  of  the  discussion  we  shall 
compare  both  perturbation  and  variational  methods. 

For  the  test  function,  Hylleraas  chose  the  ground-  state  function 
(23.27)  of  the  hydrogen  atom,  replacing  the  charge  Z  by  a  certain 
effective  charge  2'.  The  quantity  Z  is  the  unknown  parameter  which 
has  to  be  determined  from  the  variational  principle.  The  test 
function 


<23-47> 


will  obviously  be  normalized  to  unity,  just  like  (23.37),  since  its 
normalization  is  independent  of  the  value  of  Z'. 

The  Hamiltonian  H  in  (23.46)  must  include  both  the  zero-order 
approximation  Hamiltonian  H°  and  the  perturbation  potential  energy 
V  .  We  thus  obtain 


H  =  T,  +  V,  +  Ta  +  V,  +  V,  (23.48) 

where  T;  and  Vj(j=lt  2)  are  given  by  (23.2)  and  the  perturbation 
potential  energy  Vis  given  by  (23.5). 

Since  the  wave  functions  are  normalized  and  since  T!  +  Vi  does 
not  depend  on  ra  and  Ta-f-  Va  is  independent  of  the  coordinate  ri9 
the  average  value  of  the  Hamiltonian  is 


.  (23.49) 

where 


=  ST.  J  *i  to)  (T  v')3  **  to)  **•  <23-5°) 

(ri)2*d*xlt  (23.50a) 

V'  =  J  ft  (n)  «  fa)  |n.gra|  fxiffix*         /         (23.50b) 
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The  integral  (23.50b)  agrees  exactly  with  integral  (23.38);  therefore, 
setting  Z  =  Z',  we  obtain  in  accordance  with  (23.39) 

7'=|.^!L.  (23.51) 

The  quantity  fi  in  Eq.  (23.49)  represents  the  average  value  of 
the  kinetic  energy  of  a  hydrogen-  like  atom  with  atomic  number  Z' 
when  the  electron  is  in  the  lowest  state.  This  value  is  related  to 
the  total  energy  of  the  hydrogen-  like  atom  by  the  well-known  ex- 
pression 

T,  —  £1  =        .  '  <23-52> 


In  exactly  the  same  way,  replacing  Z'in  (23.50a)  by  Z,  we  can  ob- 
tain the  average  value  of  the  potential  energy  of  a  hydrogen-like 
atom,  since  tt  is  well  known  that  the  potential  energy  is  twice  the 
total  energy  (V  —  2£j  ).  Consequently, 

VFl  =  |j2B1  =  —  ^3.     /  (23.53) 

Z  a0 

It  follows  from  (23.49)  that  the  average  value  of  the  energy  is 
given  by  the  expression 

E  (Z')  =  4  (Z'a  —  2ZZ'  -f  4  Z'),  (23.  54) 

Ug  O 

which  is  a  function  of  the  parameter  Z'. 

Let  us  now  find  the  value  of  the  parameter  Z'  corresponding  to 
the  minimum  energy  of  the  system.  Differentiating  E  (Z')  with  re- 
spect to  Z'  and  setting  the  derivative  equal  to  zero,  we  find 


The  minimum  energy  of  electrons  in  the  helium  atom  is  therefore 


For  the  ionization  energy  we  have 


In  particular,  for  helium  (Z  =  2)  we  obtain 

5-.  (23.56) 
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This  result  is  considerably  closer  to  the  experimental  value  [see 
(23.43a)]  than  (23.43)  obtained  from  the  perturbation  theory. 
Hylleraas  later  improved  the  agreement  with  the  experiment  by 
using  several  variational  parameters.  The  result  (23.55)  for  £mm 
has  a  simple  physical  interpretation,  namely:  the  interaction  between 
the  electrons  results  in  a  screening  of  the  positive  nuclear  charge. 
The  variational  method  can  also  be  used  to  find  the  upper  energy 
limit  of  one  or  several  excited  states.  In  this  case  the  test  function 
has  to  be  chosen  in  such  a  way  that  it  is  orthogonal  to  the  wave 
functions  of  all  the  lower  states.  When  the  energy  levels  are 
arranged  in  order  of  increasing  magnitudes  (E0,  £1,  E*  .  .  .),  it  can 
easily  be  shown  that  since  <[»  is  orthogonal  to  functions  <I>o,  <N«  <N*  •  «t 
the  corresponding  expansion  coefficients^,  Clt  C2>.  .  .are  all  equal 
to  zero.  Therefore,  in  accordance  with  (23.45),  the  energy  is 


where  ft0  is  the  quantum  number  of  the  given  excited  state.  Using 
the    relation         |Cn|a=l,    we   find  that  the  minimum  value  of  this 


energy  corresponds  to  the  unknown  energy  Eno  of  the  excited  state. 
The  application  of  this  method  to  the  calculation  of  the  energies  of 
the  highest  excited  states  is  rather  difficult,  because  of  the  neces- 
sity of  introducing  a  large  number  of  additional  conditions  to  ensure 
that  the  wave  function  of  a  given  state  will  be  orthogonal  to  the 
wave  functions  of  the  lower  states. 


E.    DERIVATION  OF  THE  SCHRODINGER  EQUATION 
BY  THE  VARIATIONAL  METHOD 

We  shall  consider  one  of  the  most  general  forms  of  the  variational  problem,  when 
the  choice  of  the  test  wave  function  41  that  is  used  to  find  the  average  value  of  the 
Hamilton!  an 


=  f 


(23.58) 


describing   the  motion  of  a  single  particle  is  restricted  only  by  the  normalization  con- 
dition 


f  ty*fy  d*x 


(23.59) 


Upon  varying   E  with  respect  to  fy  and  using  the  hermiticity  of  the  operator  Hf  we  ob- 
tain 

8E  =  C  (&<|;*H<|;  +  UH*<|/*)  d*x  =  0.  (23.60) 
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Here  the  variations  8^  and  ty*  cannot  be  regarded  as  independent  since  they  are  con- 
nected by  the  normalization  condition  (23.59).  These  variations  can  be  made  independent 
by  varying  the  condition  (23.59) 


C  ty*<\>d3x+  f  a<|4*  (/•*  = 


Multiplying  this  equation  by  a  constant  Lagrangian  multiplier  X  f  which  is  chosen  in  such 
a  way  that  the  variations  are  now  independent,  we  add  the  resulting  equation  to  (23.60). 
Since  the  variations  ty  and  H*  are  now  arbitrary,  the  variational  principle  gives  auto- 
matically the  Schrodinger  equation  for  fy  and  ty*: 

(H  —  £)<]>  =  0,    (H*  —  £)^*  =  0,  (23.61) 

and  the  physical  meaning  of  the  parameter  A  becomes  clear:  it  is  the  negative  of  the 
energy  E  (X  =  —  E). 

Consequently,  the  variational  principle  and  the  normalization  condition  together  lead 
to  the  Schrbdinger  equation.  It  is  apparent  from  the  above  results  that  the  eigenvalues 
of  the  Schrodinger  equation  (23.61)  give  the  extremals  of  the  variational  integral.  A 
more  detailed  analysis  shows  that  these  extremals  are  minima  and  that  the  energy  of 
the  ground  state  corresponds  to  the  absolute  minimum—  the  smallest  possible  energy 
value.  In  order  to  calculate  excited  states,  it  is  necessary  to  impose  orthogonality 
conditions  on  the  wave  functions  of  the  lower  energy  state  (as  mentioned  above),  whereas 
orthogonality  follows  automatically  in  the  Schrbdinger  theory. 

F.    HARTREE-FOCK  METHOD  OF  SELF-CONSISTENT 

FIELDS 

We  have  considered  two  extreme  cases  of  the  variational  method. 
In  one  case  (the  Ritz-Hylleraas  method)  the  variation  of  the  wave 
function  amounted  to  a  determination  of  the  "best  values"  of 
parameters  in  a  specially  chosen  wave  function.  In  the  other  case, 
the  choice  j)f  the  \yave  functions  was  restricted  onlxbjj^ 
izattoiTccM^  Schrodinger  equation.  ~ThereT 

is  also  an  intermediate  case  in  which  the  wave  function  is  not 
specified,  but  is  assumed  to  be  a  product  of  one-electron  functions 
depending  on  the  coordinates  only.  The  specific  form  of  these 
functions  is  found  by  using  the  method  of  successive  approxi- 
mations to  solve  an  equation  derived  on  the  basis  of  the  variational 
principle. 

One  such  method  was  proposed  by  Hartree  (1928).  This  method, 
whose  interpretation  from  the  point  of  view  of  the  variational 
principle  was  given  later  by  Fock,  may  be  described  as  follows. 
Let  us  start  with  the  variational  principle  for  two  particles  in 
general  form7 

E  =     f  (rl9  ra)  H'|>  (rlf  r«)  d^x,  d*x*.  (23,62) 


As  an  additional  condition,  we  require  that  the  total  wave  function 
should  be  a  product  of  one-electron  functions 

).  (23.63) 


7This   integral  can  be  generalized  in  a   similar  fashion  to  the  case  of  three  or  more 
particles 
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It  is  also  necessary  to  allow  for  the  normalization  condition 

'*i^=l,  (23-64) 

which  can  be  written  for  each  particle  separately 


Substituting   the   test   function   (23.63)  into  (23.62)  and  varying  <|>, 
and  <|»9f  we  obtain 


(23.65) 


where  Hy  =  pJ-j-  +  Vy(ry)  is  the  Hamiltonian  describing  the 
motion  of  a  single  electron  (/=  1,  2),  and  —  °-  represents  the  in- 

teraction energy  between  the  electrons. 

From  the  normalization  condition  (23.64)  we  obtain  a  relation 
between  the  variations 

J  WtlM*  +  Wftifc  +  t?Wi<h  +  t?W*i«  W  dM8*.  =  o. 

Multiplying  this  equation  by  the  Lagrange  multiplier  A  =  —  £  and 
adding  it  to  (23.65),  we  can  select  A  so  that  all  the  variations 
8ti»  Wi  and  so  on,  will  be  independent.  Therefore,  we  obtain  the 
Hartree  equations 


l  Ii  +  (  fc'H^d^  +  (  W  -$-  h<Px*  -  fik  =  0, 

•'  •'  ria  / 

/  c  r       p*  \ 

(l  I,  -f  J  $f  I  U'h  A'i  +  J  tf  -^-  *td^,  -£*,  =  0. 


Analogous  equations  can  also  be  obtained  for  the  complex  conjugate 
functions.  We  multiply  the  first  equation  by  ty*  and  integrate  it 
over  the  whole  coordinate  space  of  the  second  particle;  in  a 
similar  fashion,  we  multiply  the  second  Hartree  equation  by  ty*  and 
integrate  it  over  the  coordinate  space  of  the  first  particle.  Adding 
the  two  resulting  equations  and  dividing  the  total  sum  by  2  we 
obtain  the  expression  for  the  energy 


=  ^  J 


'     *'**'*'  ~V    >>''>'•  (23.66a) 
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where  in  the  case  of  two  particles  /,  /=!,  2.    This  equation  can 
also  be  used  successfully  for  a  larger  number  of  particles. 

If  we  neglect  the  interaction  energy  (that  is,  if  we  set  the  terms 

containing  -^-  equal  to  zero)  and  use  the  relations 


and 


Hartree'  s    equation   can   be    resolved    into   a  system  of  two  inde- 
pendent equations 

(H,  -£,)<[,,  =  0 

describing  the  motion  of  each  particle  individually. 

In  problems  treated  by  the  Hartree  method,  the  field  in  which 
each  electron  moves  is  composed  of  an  external  field  (for  example, 
the  nuclear  field)  and  the  field  produced  by  all  the  other  electrons. 
This  method  is,  therefore,  known  as  the  self-consistent  field 
method. 

Fock  (1930)  generalized  the  Hartree  method  by  taking  into 
account  the  exchange  effects.  For  this  purpose  it  is  necessary 
that  the  test  function  in  the  initial  equation  (23.62)  be  consistent 
with  the  Paul!  exclusion  principle.  The  choice  of  test  functions 
is,  therefore,  further  restricted  by  the  requirement  of  anti- 
symmetry (antisymmetric  functions  satisfying  the  exclusion  prin- 
ciple will  be  considered  in  greater  detail  in  the  next  chapter). 

Hartree'  s  system  of  equations  (for  example,  for  an  electron  shell 
in  an  atom)  is  solved  by  the  method  of  successive  approximations. 
First  of  all,  the  wave  functions  in  the  zeroth  approximation  (that 
is,  neglecting  mutual  interaction  between  the  electrons)  are  deter- 
mined. These  wave  functions  together  with  the  interaction  potential 
between  the  electrons  are  then  used  to  obtain  the  first  approxi- 
mation equations.  The  first  approximation  wave  function  is  sub- 
stituted back  into  the  Hartree-  Fock  equations  to  obtain  the  next 
approximation  and  so  on.  This  process  is  continued  until  the 
solutions  obtained  in  successive  approximations  are  identical 
(within  the  desired  accuracy),  that  is,  until  the  solutions  are  self- 
consistent.  Hartree'  s  system  of  equations  can  be  solved  only  by 
means  of  numerical  methods  of  integration.  With  the  help  of 
modern  computers  it  is  now  possible  to  use  Hartree's  method  to 
determine  the  energies  and  the  wave  functions  for  both  light  and 
heavy  atoms. 

Another  approximation  which  can  be  used  in  treating  heavy 
atoms  is  the  Thomas-Fermi  statistical  method.  Although  this 
method  is  not  as  accurate  as  the  self-consistent  field  method, 
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it  is  comparatively  simple  and  predicts  many  properties  of  multi- 
electron  atoms.  This  method  will  be  used  in  our  subsequent  in- 
vestigations, and  we  shall  consider  it  in  Chapters  25  and  26  in  con- 
nection with  the  theory  of  the  periodic  system  of  elements. 

G.    INVESTIGATION  OF  THE  EXCHANGE  ENERGY 

Let  us  consider  in  somewhat  greater  detail  the  physical  mean- 
ing of  the  exchange  energy  (23.24),  which  represents  the  average 
value  of  the  Coulomb  interaction  between  two  electrons  that  are 
both  partly  in  state  n\  and  partly  in  state  tii.  In  accordance  with 
Eqs.  (23.30)  and  (23.32),  the  total  energy  of  the  system  is  re- 
lated to  the  Coulomb  energy  K  and  the  exchange  energy  A  by  the 
expression 

E  =  F  +  K±A,  (23.67) 

where  the  plus  sign  corresponds  to  <j>s,  and  the  minus  sign  to  <]>a. 
In  order  to  analyze  the  exchange  energy  in  greater  detail,  we  shall 
examine  the  behavior  of  the  system  in  time,  taking  into  account  the 
exchange  energy.  The  wave  functions  of  the  symmetric  and  anti- 
symmetric states  may  be  written  as 

f  (0  =  f  e"  *  £S  and    <l»a(0  =  *Y~  »  **'.  (23.68) 

Introducing  the  notation 

£±*=».     4  =  8,  (23'69) 

Eq.  (23.68)  may  be  represented  as 

<,'(/)='(„  +  «)«-'"'-"', 

V  (23.70) 

y(t)  =  -±-(U-v)e->»'+'». 

Let  us  consider  a  state  of  the  system  described  by  a  super- 
position of  the  solutions  <j»s(0  and  <|»a(/) 


r2,  0  =  ^(0  =  CY<0  +  CV(0-  (23.71) 

It  is  easy  to  show  that  the  function  W  (t)  represents  the  general 
solution  of  the  Schrodinger  equation  (23.7)  in  the  first-order 
perturbation  theory. 

We  shall  now  assume  that  at  the  initial  instant  of  time  (/  =  0) 
one  of  the  electrons  is  in  state  nl9  and  the  other  is  instate  n%. 
Then  the  function 

\V  (0)  =  -™-  {(C  s+  Ca)  u  +  (Cs  —  Ca)  v}  (23.72) 
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is  simply  equal  to  the  function  u.  It  follows  that 

-£=(CS+ <?)=!, and  Cs-Ca=:0, 
that  is, 

CS /~»a I  /OQ   7Q\ 

—  u    — ~777T-  1^0. /OJ 


According   to  Eqs.   (23.70),   (23.71)  and  (23.73),  at  a  certain  later 
time   t,  the  function  *F  (t)   becomes 


)  =  <?-""  {u  cos  S/  —  iv  sin  3/}  ==<rttu'  {Cuu  +  Cvv\,         (23.74) 
where 

(23.75) 


It    is    obvious    that   the   amplitudes    Cu  and    CV9  which  satisfy  the 
normalization  condition 


|  /"»      |12      I        1/^1? t  /OO    r7fi\ 

]  Ctf  I  -|-  |C.y|*=J,  (2o.7o) 

characterize  the  probability  of  a  system  being  in  states  described 
by  u  and  v,  respectively. 

Since  at  /  =  0  we  have  Cv  =  0  and  Cn  =  1 ,  the  system  is  initially 
in  a  state  represented  by  the  function  u.  However,  after  a  time 

t=™  (23.77) 

(so  that   8/  =  y),  it  can  be  seen  from  (23.75)  that  the  coefficients 
Cu  and  Cv,  will  have  the  values 


At  the  time  /  =  T,the  state  of  the  system  will  therefore  no  longer 
be  described  by  the  function  u,  but  instead  by  the  function  v.  This 
indicates  that  whereas  at  time  f  =  0  one  of  the  electrons  of  the 
system  is  in  state  n\  and  the  other  in  state  «2,  after  a  time  A/  =  t, 
the  first  electron  will  be  in  state  /i2  and  the  second  electron  in 
state  /ij.  The  time  T  (23.77)  during  which  the  " exchange"  of 
electronic  states  occurs  is  called  the  "exchange  time."  It  is  con- 
nected with  the  exchange  energy  A  by  the  simple  relationship 

—  »=•••  <23-78> 
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It  is  readily  seen  that  if  there  is  no  exchange  energy  (A=Q)  then 
T  =  00. 

To  conclude  this  section,  let  us  evaluate  the  exchange  energy 
when  the  first  electron  is  in  the  state  nt=ls,  and  the  second 
electron  is  in  the  state  n2  =  2s.  The  wave  function  tym  —  tyi  is  given 
by  (23.37)  and  the  wave  function  t««  =  1'«  can  be  found  from  (13.26) 
and  (13.28)  8 


For  the  exchange  energy,  instead  of  (23.38),  we  obtain 

)  fc(ra)  .|r|^ra|  ti  W  *i  (ri)  ^  A  (23.80) 


A  =  J 


Carrying  out  the  integration  in  the  same  way  as  was  done  for  the 
evaluation  of  energy,  we  obtain,  instead  of  (23.38a), 


6 

Evaluating  these  integrals  (which  is  a  relatively  simple  process) 
we  find  that 


(23.81) 

In  accordance  with  (23.78),  the  exchange  time  of  electrons  in  the 
Is  and  2s  states  in  the  helium  atom  (Z  =  2)  has  the  value 


The  exchange  time  in  the  case  where  one  of  the  electrons  is  in 
the  Is  state  and  the  other  in  the  10s  state  is  of  the  order  of  several 
years,  and  there  is  practically  no  exchange.  Consequently,  it  can 
be  seen  that  the  exchange  energy  plays  a  significant  part  only 
when  the  probability  densities  It/iJ*  of  different  states  mutually 
overlap  to  a  considerable  extent.  When  the  overlapping  of  the 
wave  functions  is  insignificant,  the  exchange  energy  A  is  very 
small.  This  situation  somewhat  resembles  the  transfer  of  energy 
from  one  coupled  pendulum  to  another.  It  is  well  known  that  an 
oscillating  pendulum  transfers  its  energy  to  another  pendulum, 


8See  also  Problem  13.1. 
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initially  at  rest,  and  its  amplitude  of  oscillations  after  a  certain 
time  interval  becomes  equal  to  zero.  The  exchange  time  of  the 
oscillatory  energy  depends  to  a  great  extent  on  the  relationship 
between  the  natural  frequencies  of  oscillation  of  the  two  pen- 
dulums; the  time  of  exchange  attains  a  maximum  when  the  two 
frequencies  coincide  (the  case  of  resonance).  It  must  be  em- 
phasized that  this  analogy  is  purely  formal  and  is  possible  only 
because  of  the  wave  properties  manifested  in  both  phenomena. 


Chapter  24 

Elementary  Theory  of  Multielectron  Atoms 
Including  Spin  States 

A.    SYMMETRIC  AND  ANTISYMMETRIC  STATES 

As  we  pointed  out  at  the  beginning  of  Chapter  23,  the  quantum- 
mechanical  theory  of  assemblies  of  identical  particles  contains 
several  features  that  have  no  classical  analog.  The  most  im- 
portant of  these  features  arise  from  the  fact  that  the  state  of  a 
system  is  unaffected  by  interchange  of  identical  particles.  This  is 
known  as  the  principle  of  indistinguishability. 

We  shall  consider  here  the  general  properties  of  a  wave  func- 
tion that  describes  a  system  composed  of  two  identical  particles. 
The  state  of  the  system  is  characterized  by  the  position  vector  r, 
three  spatial  quantum  numbers  ti  ,  /  and  m  ,  and  the  spin  quantum 
number  s.  For  the  sake  of  brevity,  we  let  n  denote  all  three  of  the 
quantum  numbers  /*,  /  and  m9  The  wave  function  of  the  system 
has  the  form 

^(/ZiVi;  n&r>),  (24.1) 

where  subscripts  1  and  2  refer  to  the  first  and  second  particles, 
respectively. 

Let  us  introduce  an  operator  p  which  permutes  the  position 
coordinates  r\  and  r^  or  the  quantum  numbers  nlt  si  and  «*,  &, 


PvIr("iVY,  'Wi^^faiVa;  n^r^  (24.2) 

It  is  easy  to  find  the  eigenvalues  of  this  operator 

(24.3) 


It   follows  from  (24.2)  that  two  successive  applications  of  P  lead 
to  the  initial  wave  function 

(24.4) 


THEORY  OF  MULTI  ELECTRON  ATOMS  INCLUDING  SPIN  STATES     379 

On  the  other  hand,  it  follows  from  (24.3)  that 

P2*F  (/i^ri;  n2s2ra)  =  W  (n&r^  /z2sar2).  (24.5) 

The  eigenvalues  of  the  permutation  operator  are  therefore 

X  =  dbl.  (24.6) 

This  result  means  that  interchanging  the  particles  leaves  the  wave 
function  unchanged  (symmetric  wave  functions,  with  A  =  1  ) 

Ws  (riiw,  /z*Si/-a)  =  iFs  (flis^;  nLs^)t  (24.7) 


or  causes  the  wave  function  to  reverse  its  sign  (antisymmetric 
wave  functions  with  X  =  —  n 

^aOWt;  /i«V«)  =  —  Vi(/i1s1ri;  HIS/-,).  (24.8) 

According  to  a  principle  of  quantum  mechanics,  a  set  of 
identical  particles  can  only  exist  in  states  with  a  definite  type  of 
symmetry,  A  state  is  either  symmetric  (described  by  a  symmetric 
wave  function)  or  antisymmetric  (described  by  an  antisymmetric 
wave  function).  Quantum  transitions  between  symmetric  and  anti- 
symmetric states  are  impossible;  states  with  different  types  of 
symmetry  do  not  mix.  This  suggests  that  there  are  two  kinds  of 
particles:  one  kind  described  by  symmetric  wave  functions, 
and  the  other  by  antisymmetric  wave  functions. 

B.    FERMI-DIRAC  AND  EINSTEIN-BOSE  STATISTICS  . 
THE  PAULI  EXCLUSION  PRINCIPLE 

Experimental  and  theoretical  investigations  of  the  properties 
of  systems  of  identical  particles  have  shown  that  there  are  two 
kinds  of  particles  with  fundamentally  different  statistical  prop- 
erties, and  that  this  difference  is  essentially  related  to  the  spin. 

/         1       3 

Particles    with   half-  integral   spin    (s  =  -^>  -^t  ...   in   units    of  the 

Planck  constant  /y  obey  Fermi-Dirac  statistics.  These  particles 
are  known  as  fermions  and  include  electrons,  protons,  neutrons, 
H  mesons  and  hyperons  (all  with  spin  Va).  On  the  other  hand,  par- 
ticles with  integral  spin  (s  =  0,  1,  ...)  obey  Einstein-Bose  statistics. 
These  particles  are  known  as  bosons  and  include  w  mesons,  K 
mesons  (both  with  spin  0),  and  photons  (spin  1). 

We  may  note  here,  without  going  into  a  detailed  analysis  of  the 
statistical  properties  of  aggregates  of  particles,  that  in  Einstein- 
Bose  statistics  an  unliminted  number  of  particles  can  occupy  each 
state.  On  the  contrary,  in  Fermi-Dirac  statistics  only  a  single 
particle  can  occupy  each  state  defined  by  four  quantum  numbers. 
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This  characteristic  feature  of  fermions  was  established  empirically 
by  Pauli  (1923)  before  the  discovery  of  quantum  mechanics  and 
quantum  statistics;  it  is  known  as  the  Pauli  exclusion  principle. 
In  order  to  establish  the  relationship  between  the  type  of  sym- 
metry of  a  state  and  the  kind  of  statistics  that  is  appropriate  for 
this  state,  let  us  consider  a  system  of  two  particles  which  have  a 
negligible  mutual  interaction.  The  wave  function  for  the  system 
can  be  written  in  the  form  of  the  product 

W  (/i^;  n&rj  =  ^  (rj  $„&  (r%),  (24.9) 

where  tyniSl  and  <ta*s  are  the  wave  functions  of  the  individual  parti- 
cles. The  solution  (24.9)  corresponds  to  a  state  in  which  the 
particles  with  positions  r\  and  ra  are  in  states  characterized  by  the 
quantum  numbers  /tiSi  and  flaSj,  respectively.  Because  of  the 
identity  of  the  particles,  the  state  of  the  system  is  not  changed 
when  the  particles  are  interchanged.  The  following  solution  is 
therefore  just  as  valid  as  (24.9): 

;  nlSir*)  =  ^  (r,)  ^  (/•„)  (24.10) 


(that  is,  particles  with  coordinates  r^  and  r±  are  in  the  states 
and  NISI  ,  respectively). 

Both  wave  functions  (24.9)  and  (24.10)  describe  the  same 
physical  state  of  the  system.  We  note  that  any  linear  combination 
of  these  functions  will  also  describe  the  same  physical  state.  One 
of  the  possible  linear  combinations  is  the  sum  of  the  solutions 
(24.9)  and  (24.10),  namely, 

*s  -  y-  {  *»,.,  to)  W,  (r,)  +  ^  (/-,)  ^  (r,)  }  (24.11) 

(we   have   assumed  that  the   functions   ty*^  and  ^^  are  mutually 
orthogonal  and  normalized  to  unity,  and  therefore  we  have  intro- 


duced the  normalization  factor   y^.    This  solution  is  symmetric 
with  respect  to  interchange  of  the  spatial  and  spin  coordinates 

(24.12) 


Another  possible   linear   combination   is   the   difference   between 
(24.9)  and  (24.10) 

wa  =  yf  {  *«i*i  to)  tyn^  (r,)  —  ^151  (r,)  ^3,3  (r,)  },          (24.  13) 

which  gives  an  antisymmetric  solution  since 

(24.14) 
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From  the  form  of  *FS  and  ^d  it  follows  directly  that  particles 
obeying  Fermi-Dirac  statistics  can  be  described  only  by  anti- 
symmetric solutions.  The  Pauli  exclusion  principle  (Fermi  sta- 
tistics) is  satisfied  only  by  a  solution  of  the  form  (24.13)  since 
when  both  particles  are  in  the  same  state  (that  is,  n{  =  n^^=n  and 
st  =  Si  =  s) ,  we  have 

W*  (nsr^  nsr*)  =  0,  but  llrs  (nsrv\  nsrj  ^  0.  (24. 15) 

On  the  other  hand,  ^a  does  not  satisfy  the  requirement  of  Einstein- 
Bose  statistics,  in  which  an  arbitrary  number  of  particles  can 
occupy  the  same  state. 

It  is  easy  to  generalize  the  antisymmetric  and  symmetric  wave 
functions  to  the  case  of  a  system  composed  of  N  noninteracting 
particles.  The  antisymmetric  solution  may  be  represented  by  the 
determinant 


_1 

YN\ 


(24.16) 


The  antisymmetric  nature  of  this  solution  is  obvious  since  inter- 
changing any  two  columns  changes  the  sign  of  the  determinant. 
Moreover,  it  is  obvious  that  the  function  (24.16)  satisfies  the  Pauli 
principle  because  if  any  of  the  sets  of  quantum  numbers  are  equal 
(for  example,  n1  =  /i2,  Si  =  s2)  the  determinant  vanishes. 

The  symmetric  wave  function  can  be  written  in  the  form  of  a 
sum  of  products 


WS  =  TT^f  1  Wi  to)  *«.'.  to  '  •  •     *«*^  (r^  + 

"•••}•  (24.17) 


C.    ADDITION  OF  ANGULAR  MOMENTA.      RUSSELL-SAUNDERS 
COUPLING,  CLEBSCH-GORDAN  COEFFICIENTS 

Let   us    consider  more  fully  the  antisymmetric  wave  function 
for  a  system  composed  of  two  particles 

V*  =  -^r  t  *»i'«  to)  fa*  to  -  *«,„  (r*)  ^a,2  (r,) }.  (24. 18) 


In  the  many- body  problem,  we  are  faced  with  the  question  of  how 
to    add    the    angular    momenta.     In  the   two-particle   case,  each 
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particle   is  characterized  by  orbital  angular  momentum  quantum 
numbers  /t  and  /2  ,  respectively,  for  which 

L!  =  tf/t  (/,+  !)  (24.19) 

and  spins  s4  and  s2,  for  which 

.  (24.20) 


There  are  two  different  ways  in  which  these  four  quantities  can  be 
added.  One  of  these  involves  adding  the  orbital  and  spin  angular 
momenta  separately  and  then  finding  their  total  sum 

L  =  L!-fL2,     S  =  S,  +  S2,    J=L  +  S  (24.21) 

This  type  of  coupling  of  the  particles  in  an  atom  is  called  LS 
coupling  or  Russell-Saunders  coupling.  It  corresponds  to  the  case 
in  which  the  total  orbital  angular  momentum  and  the  total  spin 
angular  momentum  are  conserved  separately.  This  situation  is 
encountered  most  often  in  light  elements. 

The  second  way  of  combining  the  angular  momenta  involves  the 
addition  of  the  orbital  and  spin  angular  momenta  of  each  electron 
individually  and  then  finding  the  total  angular  momentum  of  all  the 
particles 

J^LL  +  SL,     J^Li  +  S,;     J^  +  J,.  (24.22) 

This  type  of  coupling  is  called//  coupling  and  is  encountered  mainly 
in  heavy  elements.  It  corresponds  to  the  case  in  which  only  the 
total  angular  momentum  is  conserved,  and  therefore  it  is  im- 
portant when  there  is  a  strong  spin-orbit  interaction. 

We  shall  now  compare  the  order  of  magnitude  of  the  Coulomb 
and  spin-orbit  interactions  in  the  helium  atom.  The  Coulomb 
energy  [see  (23.23)]  has  the  form 


^el  ( 

J 


(24.23) 


(T  lies  approximately  between  0.1  and  1).  The  existence  of  LS 
coupling  depends  on  the  Coulomb  energy  since  the  orbital  and  spin 
angular  momenta  are  conserved  separately  in  this  interaction.  The 
spin-orbit  interaction  for  the  helium  atom 


=  L'  s    V  ~  RMW  (24.24) 


(a  =  VIM)  is  considerably  smaller  than  the  Coulomb  interaction,  and 
therefore  the  coupling  is  of  the  LS  or  Russell-Saunders  type.  It  can 


THE  MULTI  ELECTRON  ATOMS  INCLUDING  SPIN  STATES       383 

be  seen  from  (24.24)  that  the  order  of  magnitude  of  the  spin-orbit 
interaction  depends  strongly  on  the  nuclear  charge  z  (^Z4),  so 
that  for  large  Z  (heavy  elements)  the  value  of  Es  °-  may  be  quite 
large.  In  this  case  the  coupling  will  be  of  the  //  type. 

Let  us  consider  the  problem  of  the  addition  of  the  orbital  angular  momenta  of  two 
particles  and  find  the  function  YLM  (1,2)  which  is  an  eigenf  unction  of  the  operator  for  the 
z  component  of  the  total  angular  momentum  L  =  L!  +  L3 


L*  YLM  (W)  =  <L'*  +  L*)  YW  0.2)  =  *MYLM  (W).  (24-25> 

and  of  the  operator  for  the  square  of  the  total  angular  momentum 

L*yLM  (1,2)  =  {Lf  +  LI  +  (Llx  -  iLtj,)  (Ltjf  +  ilzy)  +  (Llx  +  tLly)  (L9je  -  iL,y)  + 

+  2^,1^}  YLM  (1,2)  =  W\YLM  (1,2);    l  =  L(L+\)  (24>26) 

As   we  shall  show,  the  eigenfunction  YLM^  (1,2)  can  be  written  as  a  linear  combination  of 
products 


.?)  -          *"i«»  >  «!'  W  >  V  W.  (24.27) 


where    the    wave    functions    X™1  (1)   and    Y%*  (2)  refer  to  the  first  and  second  particles, 
respectively,  and 

Lt,  >U!  (O-^^1  (1);     Lfc  y?!2  (2)-^maK^  (2); 
M  Y"1  0)  =  ^2^i  (/i  +  0  >T  (1);    LI  ^2  (2)  =  »2/3  (/j  +  1)  V£s  (2)          (24.28) 

The  numerical  coefficients  ^m  OTi   in  the  linear  superposition  (24.  27)  arc  called  the  vector 
summation  coefficients  of  Clebsch-Gordan  coefficients. 

For  simplicity,  we  shall  restrict  ourselves  to  the  case  /,  =  /,  =  1.  Substituting  (24.27) 
into  Eq.  (24.25) 


we  find  that  M  =  Wi  +  /"sand  ^n,/na  =  ^m,* 

Without  loss  of  generality  we  may  set  M  =  0;that  is,  we  make  the  z  axis  perpendic- 
ular to  the  total  angular  momentum.  Then  it  follows  from  (24.27)  that 

yU(W  =  *-.n  (1)  J71  (2)  +  ».Kr'  (1)  Y[  (2)  +  ftoKJ  (1)  yj  (2).  (24.30) 

Taking  into  account  the  equation  for  spherical  harmonics  [see  (11.88)] 

(Li;c±  fLjj)  ^  (1)  =  -  *  V(/i  +  1  -±  Wi)(/i  T  mi)  ^^'(l),  (24.31) 

and  using  Eq.  (24.26),  we  obtain 


L'KZO(1,2)  -  /i2  {*_,  [2K{  Kr1 
+  ^o  [4X?  KJ  +  2KJ  Kf1  +  2K;  l  Ki]  }  =  X  (^  /j  K,  »  +  ft,  Kf1  KJ  +  &0X{  K?),      (24.32) 

This  equation  is  equivalent  to  a  system  of  three  homogeneous  equations  for  the  unknown 
Clebsch-Gordan  coefficients 

A)*i  =0 

H-(2—  X)ft.1==0  (24.33) 
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This  system  has  a  nonv finishing  solution  if  its  determinant  is  equal  to  zero 

2       2  — X       0 


0       2  — X 


4-X       2 


=  0. 


(24.34) 


Calculating  this  determinant  and  subjecting  the  wave  function  YLJA  to  the  normalization 
condition 


that  is, 


fl>W 
J        LM 

*o  + 


we  obtain  three  possible  solutions: 


From  the  conditions  X  =  L  (L  +  1)  we  find  that  L  —  0;  that  is,  this  solution  corresponds  to 
the  case  in  which  the  vectors  fa  and  fa  are  antiparallel 


fa   f     i  /•• 


2)     X  = 


-, 


This   case  corresponds   to  the   addition  of  the   angular  momentum  vectors  at  an  angle 
of  60° 


3) 


This  solution  corresponds  to  parallel  fa  and  fa 

/,  f    f  A. 

We  conclude,  therefore,  that  two  vectors  fa  and  fa,  whose  absolute  values  are  integral 
quantum  numbers,  can  be  oriented  at  angles  such  that  the  vector  representing  their  sum 
L  is  also  characterized  by  integers.  For  fa  ^  /„  these  values  are 


,  fa  — i 


(24.35) 


(a  total  of  2/a  +  1  values). 

This  method  of  adding  the  angular  momenta  is  based  on  a  vector  model  of  the  atom. 
We  can  obtain  our  previous  results  by  setting  /1=/2=1  andAf  =  0.  It  is  interesting  to 
note  that  the  same  values  of  the  Clebsch-Gordan  coefficients  would  also  be  obtained  in 
adding  the  orbital  angular  momentum  to  a  spin  angular  momentum  equal  to  unity  (for 
example,  for  photons X  In  this  case,  however,  one  would  have  to  use  the  normalized  spin 
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function  instead  of  the  spherical  harmonics   Y%*.   It  is  easy  to  show  that  the  Clebsch- 
Gordan  coefficients  will  satisfy  the  following  conditions: 


In  general  the  Clebsch-Gordan  coefficients  bm±  will  depend  not  only  on  the  quantum 
number  w3  =  M  —  mit  but  also  on  L  and  Ai.  If  the  orbital  angular  momentum  of  the 
second  parade  is  /a  =  1,  then  adding  both  orbital  angular  moments  ,  we  obtain 


for  which  the  Clebsch-Gordan  coefficients  are  given  in  Table  24.1. 


D.     WAVE  FUNCTION  OF  THE  HELIUM  ATOM 
INCLUDING  SPIN 

We  shall  now  consider  more  fully  the  wave  function  of  the 
helium  atom,  for  which,  as  we  know,  the  interaction  between  the 
spins  and  the  orbital  angular  momenta  of  the  electrons  is  of  the 
Russell-Saunders  type.  Since,  therefore,  the  orbital  and  spin 
angular  momenta  are  added  independently  of  one  another,  the  wave 
function  can  be  written  as  a  product  of  two  parts,  one  of  which 
depends  on  the  spins  of  the  particles  and  the  other  on  their  co- 
ordinates. The  wave  function  must  be  an  antisymmetric  function 
with  four  quantum  numbers 


fa  /•*)  =  —  C(s2,  s,)*,^/-,,  ra)  = 
=  -C(slf  SfcH^ta,  rO.  (24.36) 


There  are  two  possible  cases:  either  the  function  is  symmetric 
with  respect  to  the  spins  and  antisymmetric  with  respect  to  the 
coordinates,  or  the  reverse  case  is  true.  We  have  therefore  the 
following  two  types  of  solution: 

„</•„  r,),  (24-37> 

i8(n,  /•«).  (24,38) 

We  recall  that  the  spatial  part  of  the  wave  function  has  already 
been  obtained  (see  Chapter  23).  For  n^  ^=n^t 

s  1 

'P/iirt.i  (f\t    /*a)  =  —7=  (U  ~T~  &0,  /9A  ^Q\ 

T/II"J.\  i»     «/        |^2  ^*.otf^ 
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where 


(n).  (24.41) 

Let  us  now  investigate  the  spin  part  of  the  wave  function.  It  has 
already  been  mentioned  that  in  Russell-Saunders  coupling  the  spin 
angular  momenta  are  added  independently  of  the  orbital  angular 
momenta.  The  law  of  conservation  of  spin,  therefore,  provides  a 
basis  for  the  construction  of  the  spin  wave  function  for  the  two- 
electron  system.  We  shall  take  the  spin  function  of  each  electron 
to  be  an  eigenfunction  of  the  operator  for  the  z  component  of  the 
spin 


,       23, 
and  also  of  the  operator  for  the  square  of  the  spin 


(24.42) 


(24.43) 


Here  we  write  2x2  Pauli  matrices  <y  (see  Chapter  16)  without 
primes 


Hi-?)- 

Thus    the  spin  function  C=(CM   of  a  single  particle  satisifies  two 


C 
equations 


(24'44) 


(24.45) 

Since  a*—  1  ,  and  so  on,  we  obtain  from  Eq.  (24.45)  X,2  =  3/4.  The 
matrix  equation  (24.44)  for  Xj  is  equivalent  to  a  system  of  two 
homogeneous  algebraic  equations 


0.  (24.46) 

from  which  it  follows  that  there  are  two  solutions  corresponding  to 
the  two  possible  orientations  of  the  spin  with  respect  to  the  z  axis 

1)         *!  =  '/„    *!=!,    C2  =  0.  (24.47) 
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Here  the  spin  is  parallel  to  the  z  axis.  The  wave  function  corre- 
sponding to  the  eigenvalues  l/2  is 

(24.48) 

2)  X1==  — '/„    d  =  Q,    ca=l.  (24.49) 

In  this  case  the  spin  is  antiparallel  to  the  z  axis.  The  correspond- 
ing wave  function  is 

/        ,  v        /m 

(24.50) 

In  Eqs.  (24.48)  and  (24.50)  the  value  of  the  z  component  of  the 
spin  is  indicated  in  the  parentheses  following  the  amplitude  C. 
It  is  easily  shown  that  both  spin  parts  of  the  wave  function  satisfy 
the  orthonormality  condition.  If,  in  accordance  with  the  definition, 
we  take  the  Hermitian  conjugate  spin  function  to  be  a  single-row 
matrix 


then  from  (24.48)  and  (24.50)  we  see  that 


and 


The  effect  of  Pauli  matrices  on  the  spin  function,  according  to  Eqs. 
(24.50),  (24.48)  and  (24.44),  is 


(24.51) 

We  shall  look  for  the  spin  function  C  of  the  two  electrons  of  the 
helium  atom  in  the  form  of  a  superposition 


'  \  r  I  '  \   i      r  I  '  \  r  I       l\   i 
j  C2  ^-j  -f  ajC,  ^J  C2  (  —  Tj 


(24.52) 


THE  MULTl  ELECTRON  ATOMS  INCLUDING  SPIN  STATES        389 

where  C,  [  db  -~\   and  C2  ^:±  ^-J  are  the  spin  functions  of  the  first  and 

second  electrons,  respectively,  while  a{9  a,,  a*  and  «?*  are  the 
Clebsch-Gordan  coefficients  that  have  to  be  determined.  Let  us  now 
attempt  to  select  the  coefficients  a^  of  the  function  (24.52)  in  such 
a  way  that  the  latter  will  be  an  eigenfunction  of  the  operator  for  the 
z  component  of  the  total  spin 

s,  =  s;  +  s;  =  \  w  +  a-3),  (24.53) 

and  also  of  the  operator  for  the  square  of  the  total  spin 


"2  -f  2  *'•  <J"}  = 


~ 


Here  the  primes  and  the  double  primes  on  the  Pauli  matrices 
indicate  that  these  matrices  act  on  the  spin  functions  of  the 
first  and  second  electrons,  respectively.  We  thus  have  two 
matrix  equations 

-W  +  aJ)C  =  ftX1Cf  (24.55) 


~  a'-*"  C  ==  /t%C-  (24.56) 


Substituting  the  total  spin  function  (24.52)  and  taking  into  account 
the  effect  of  the  Pauli  matrices  on  the  spin  functions  of  the  in- 
dividual electrons  [see  (24.51)],  we  find  from  (24.55) 


Equating  the  coefficients  at  the  same  spin  functions  on  both  sides 
of  the  equation,  we  obtain  the  equations  for  the  parameter  Xj  and 
the  coefficients^ 

M* — X,)  =  0,  a2X1  =  0)  a3X1  =  0,  a4  ( 1 -}- Xj)  =  0         (24.57) 
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Similarly,  using  the  relation 

(a'-  a")  C  =  aA  (  1)  C,  (1)  -  a,  [C,  (  j)  Cs  (-  1)  - 


(24.58) 

which  can  be  obtained  with  the  aid  of  (24.51),  we  obtain  from  (24.56) 

2  «j  =  X8a,  ,     2  «4  =  X8a4, 

aJ  =  (Xa-l)ali  a2  =  (X2~l)a3.  (24-59) 

As  usual  we  set  X2  =  S(,S+l)t  where  5  is  the  eigenvalue  of  the 
total  spin.  We  note  that  the  systems  of  equations  (24.57)  and 
(24.59)  must  be  satisfied  simultaneously.  From  these  equations 
we  find  that  the  possible  nonvanishing  solutions  have  eigenvalues 

X,  =  2,  with  X1  =  +  1,0,  —1        (triplet), 
X2^0,with  X1  =  0  (singlet). 

Consider  now  four  cases.  Case  1.  Total  spin  is  equal  to  unity  and 
is  directed  along  the  z  axis.  The  spins  of  both  particles  are  par- 
allel. The  corresponding  solution,  given  by  (24.52),  is  symmetric: 

(24-6°) 


Xj  =  +  1  ,     a,  =  a2  =  a4  =  0. 

Case  2.  Total  spin  is  equal  to  unity  and  is  antiparallel  to  the  z 
axis.  The  spins  of  both  particles  are  parallel.  The  corresponding 
symmetric  solution  is 

2L)  -  (Si  U  s>)  •  (24-61) 

a4=l, 


Case  3.  Total  spin  is  directed  perpendicular  to  the  z  axis  and  is 
equal  to  unity.  The  spins  of  both  particles  are  parallel.  The  cor- 
responding symmetric  solution  is 

c'  =T  ic'  (  f)  c*  (-  i  )  +  c' 
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Case  4.  Total  spin  is  equal  to  zero.  The  spins  of  the  particles  are 
antiparallel.  The  solution  is  antisymmetric: 


We  note  that  the  choice  of  the  nonvanishing  Clebsch-Gordan  co- 
efficients was  such  that  all  four  solutions  are  normalized  to  unity. 
Returning  once  again  to  the  spatial  part  of  the  wave  function  for 
the  helium  atom  and  using  the  fact  that  the  total  solution  must  be 
antisymmetric  we  have,  in  accordance  with  (24.37)  and  (24.39), 

UT«  =0  (SL  SjH-  (ri,  ra)  (24.64) 

(three  states)  and 

¥••'  —  Ca  (s,,  52)f  (r,,  r.)  (24.65) 

(one  state).    The  spin  functions  are  given  by  Eqs.  (24.60)-(24.63), 
and  the  spatial  part  of  the  wave  function  for  n{  ^  th  is 


In  the  case  where  both  electrons  are  in  the  same  state 
there  is  only  one  solution  with  a  symmetrical  spatial  part: 


(24.67) 


E.    PARAHELIUM  AND  ORTHOHELIUM 


We  have  obtained  the  wave  functions  that  describe  two  states. 
One  state  (parahelium)  is  characterized  by  antiparallel  spins  of  the 
electrons  [the  wave  function  (24.65),  which  is  symmetric  with 
respect  to  interchange  of  the  coordinates].  The  other  state  (ortho- 
helium)  corresponds  to  parallel  spins  [the  wave  function  (24.64), 
which  is  antisymmetric  with  respect  to  interchange  of  the  co- 
ordinates; see  Fig.  24.1].  A  very  interesting  property  of  these 
states  is  that  both  types  of  helium  are  stable;  in  other  words, 
parahelium  and  orthohelium  are  not  converted  into  one  another. 
We  can  convince  ourselves  that  both  systems  are  closed  by  direct 
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calculation.  The  matrix  element  of  the  dipole  moment  correspond- 
ing to  a  dipole  transition  from  orthohelium  to  parahelium 


(24.68) 


rs-a  =  J  f  s  (/VaKrj 
=  J  *'s(/-2,r,)  (O  +  /*)  <!>a 
=  -  J  fs  (/VriXn  +  r 
turns  out  to  be  equal  to  zero,  since 


'arahelium 


o 

Orthohelium 


(24.  69) 


[In  (24.68)  we  have  changed  the 
variables  of  integration  and  used 
the  symmetry  properties  of  the 
wave  functions.]  Dipole  trans  it  ions 
from  one  state  to  another  are 
therefore  forbidden. 

Mutual  conversion  of  these 
states  may,  however,  take  place 
under  the  action  of  other  parti- 

cles. For  example,  the  bombardment  of  orthohelium  by  electrons 
may  lead  to  the  replacement  of  an  ejected  electron  by  another  elec- 
tron with  opposite  spin,  and  thus  orthohelium  can  be  transformed 
into  parahelium,  and  vice  versa. 


Fig  24  1.  Electron  spin  orientation 
in  the  helium  atom 


F.  ENERGY  SPECTRUM  OF  THE  HELIUM  ATOM 

It  has  already  been  noted  that  the  total  orbital  angular  momentum 
L,  which  is  a  result  of  the  addition  of  the  orbital  angular  mo- 
menta /t  and  /.2of  two  electrons,  assumes  integral  values  (Russell- 
Saunders  coupling).  In  the  particular  case  where  tl  =  l2=l  (both 
electrons  are  in  the  p  state),  the  total  orbital  angular  momentum 
takes  the  values  L  =  2.  1,  0.  In  terms  of  the  vector  model,  these 
values  correspond  to  the  following  situations: 

1.  L  =  2.  The  angular  momenta  are  parallel; 


2.  L=\.   The  angular  momenta  are  at  an  angle  of  60°; 
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3.   £  =  0    The  angular  momenta  are  antiparallel; 
At  4  /,       i=/t-./8=o. 

In  the  general  case  where  /i^/2,  the  number  L  takes  all  possible 
integral  values 

L  =  /,  +  /„  /!  +  /,-!,  /i  -M,  -2,  ...  ,  /,-/,.  (24.70) 

In  denoting  the  energy  levels  for  multielectron  atoms,  we  follow 
the  same  rules  as  for  hydrogen-like  atoms,  except  that  the  states 
with  a  definite  value  of  the  orbital  angular  momentum  L  are  denoted 
by  capital  letters 

L  =  Q  S  state 

L=l  P  state 

L  =  2  D  state 

L  =  3  F  state,  and  so  on. 

Let  us  enumerate  the  energy  levels  of  the  helium  atom.  The  lowest 
level  corresponds  to  the  total  orbital  angular  momentum  L  =  Q  with 
both  electrons  in  the  s  state.  This  singlet  term  of  parahelium  is 
denoted  by 


and  corresponds  to  an  antiparallel  spin  orientation.  The  super- 
script at  the  upper  left-hand  side  of  the  letter  characterizing  a 
particular  term  indicates  the  number  of  states  (the  multiplicity 
of  states). 

The  next  term  corresponds  to  the  case  when  one  of  the  electrons 
is  in  the  Is  state  and  the  other  is  in  the  2s  state.  In  this  case,  both 
parahelium 

(ls2s)'S0 
and  orthohelium 


are  possible.  Orthohelium,  unlike  parahelium,  has  a  spin  angular 
momentum  and  exhibits  anomalous  Zeeman  splitting  in  a  magnetic 
field. 

We  note  that  the  (ls26')3S,  level  of  orthohelium  is  metastable, 
because  a  transition  to  the  lower  energy  state  belonging  to  para- 
helium is  forbidden  by  the  selection  rules.  The  levels  of  para- 
helium are  singlets  (spin  0),  and  those  of  orthohelium  are  triplets 
(spin  1). 
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The  splitting  can  be  easily  explained  in  terms  of  the  vector 
model  of  the  orbital  and  spin  angular  momenta.  According  to  the 
vector  model,  the  sum  of  the  two  vectors  L  and  S  (that  is,  the 

total  angular  momentum  vector) 
£ev  can  take  the  values 


24 

22 
155 
177 

'5  Pan 

ihelium     -g  Orthoh 
fy2fi)   'P^\    I/ 

—  dszsi  *s.  i  r 
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level 

\1*2S)    JJ; 


...  ,|L-S|.         (24.71) 

For  £>S,  the  total  number  of 
these  values  is 


2S+1, 


(24.72) 


whereas  for 


it  is 


2L+1. 


(24.73) 


Fig    24.2     Energy   level   diagram   for  the 

helium    atom    (the    splitting    of    the      P 

level  is  not  in  scale) 


It  follows  that  for  the  5  term 
(L  =  0),  the  total  number  of  pos- 
sible states  for  both  ortho- 
helium  and  parahelium  is  equal 
to  one.  Inexactly  the  same  way, 
the  s  term  of  the  hydrogen 
atom  (one  valence  electron)  is  a 
single,  and  the  number  of  states 

is  equal  to  two  (that  is,  the  levels  are  doublets)  only  for  p9  d  terms 
(for  which  />s). 

The  total  spin  S  of  helium  assumes  two  values:  5  =  0,1.  For 
S  — 0  the  number  of  states  (2S-J-  1)  is  equal  to  unity,  and  for  S  =  1 
it  is  equal  to  three  (25 -f- 1=3);  that  is,  the  state  is  a  triplet.  A 
general  diagram  of  the  energy  levels  of  the  helium  atom  is  given 
in  Fig.  24.2. 

For   elements  of  the  third  group  (s^^-andS  —  ~-\  the  states 

are  quartets  or  doublets.  Thus  the  number  of  valence  electrons 
(see  Chapter  25)  completely  determines  the  splitting  of  the  spectral 
lines. 

The  theory  of  multielectron  atoms  is  treated  in  several  books. l 

A  particle  is  In  the  Yukawa  potential  well 


^ee,  for  example,   D.  I.  Blokhmlsev,   Principles  of  Quantum  Mechanics  (trans.),   Allyn 
and  Bacon,   1964. 

2These  problems  refer  to  Chapters  23  and  24 


THE     MULTI  ELECTRON    ATOMS     INCLUDING    SPIN    STATES  395 

Choose  a  test  function  in  the  form  ^l  =  Aie~^ir  and  determine  the  upper  limit  of  the 
values  of  x  at  which  at  least  one  discrete  level  will  exist.  Using  the  obtained  solution, 
consider  the  special  case  of  a  Coulomb  potential  (g*  =  ??„  *  =  0).  Show  that  discrete 
levels  always  exist  for  the  Coulomb  potential.  Determine  the  lowest  energy  level  and  the 
corresponding  wave  function. 

Solution.  The  normalized  wave  function  is  of  the  form  [see  (23.47)]: 


1= 

J/7C 

The  average  values  of  the  kinetic  and  potential  energies  are  [see  (23.52),  (23.53)] 


.  - 

~  2/w0  '  I'^i  +  *)*     * 

The  appearance  of  the  first  discrete  level  is  possible  under  the  conditions 

E7=f+K  =  0,    -||i=0, 
from  which  we  find 

_  Wo£2 
max       W  '• 

Discrete  levels  exist  for7:<0.  This  happens  when*<ymax.  For  a  Coulomb  field 
x  =0,  and  therefore  this  condition  is  always  satisfied.  The  average  energy  in  the  Coulomb 
case  is 

fc-S-w 

Using  the  condition  that  Ei  must  be  minimum,  we  find 


where  o0=  -  ^ris  the  radius  of  the  first  Bohr  orbit.  The  expressions  for  the  wave  func- 
tion and  the  average  energy  will  be  exactly  the  same  as  those  yielded  by  the  Schrodinger 
equation  for  the  Is  level  of  the  hydrogen  atom  (see  Chapter  13). 

roblem  24.J)  Calculate  the  diamagnetic  susceptibility  of  the  ground  state  of  one 
g ram- atom" ' W  par ahelium ,  using  the  expression  (23.47)  obtained  for  the  wave  function  by 
the  variational  method. 

Solution*  In  the  ground  state  of  parahelium,  the  orbital  and  spin  angular  momenta  are 
equal  to  zero(/=0),  and  therefore  parahelium  must  be  diamagnetic  (see  Problem  20.  IX 

In  accordance  with  (20.48),  the  diamagnetic  susceptibility  per  gram-atom  is  given  by 
the  expression 

'—(S^^  (24.74) 

where  AT  is  Avogadro's  number  and 

Ff  +  r j~  =  ^  (n  +  r\)  |  <!/(/-!,  r,)  |2  rf\v,  (P r2.  (24.75) 

From  (23.47)  we  obtain  an  approximate  expression  for  ^  (rlf  r.) 


396  RELATIVIST1C     QUANTUM     MECHANICS 

where  for  helium 

r-7      5--7 

^  —  ^~~  ig—  16" 
Evaluating  the  integral  (24.75)  with  the  above  expression  for  <pt  we  obtain 


The  diamagnetic  susceptibility  is  therefore 

y.theor=__  1  67  -  10-«, 

which  is  in  fairly  good  agreement  with  the  experimental  value 

=—  1.90  •  10-«. 


Chapter  25 


Optical   Spectra  of  Alkali  Metals 


A.    GENERAL  DISCUSSION  OF  THE  STRUCTURE 
OF  COMPLEX  ATOMS 

It  has  already  been  pointed  out  that  the  position  of  an  atom  in 
the  periodic  table  of  elements  is  determined  by  the  atomic  number 
Z.  This  number  characterizes  the  nuclear  charge  in  units  of  e0; 
that  is,  it  is  an  integer  that  is  equal  to  the  number  of  protons  in 
the  nucleus  and  also  to  the  number  of  electrons  in  the  neutral 
atom.  Thus,  for  example,  sodium  has  11  electrons  (Z=ll)  and 
uranium  has  92  (Z  =  92). 

The  most  important  questions  in  the  theory  of  a  complex  atom 
concern  the  electron  density  distribution  and  the  energy  spectrum. 
In  studying  these  questions  it  is  necessary  to  consider  the  mutual 
interaction  between  all  the  electrons,  in  addition  to  the  attraction 
between  the  electrons  and  the  nucleus.  Owing  to  the  interelectronic 
interaction,  the  energy  values  of  the  electrons  will  be  smaller  (in 
absolute  value)  than  those  given  by  the  well-known  express  ion 


Just  as  in  the  hydrogen  atom,  each  electron  in  a  complex  atom 
is  characterized  by  four  quantum  numbers.  For  Russell-Saunders 
coupling  (the  case  where  the  spins  and  orbital  angular  momenta 
of  the  individual  electrons  in  a  given  shell  are  added  separately)  , 
these  quantum  numbers  are  as  follows: 

(1)  the  principal  quantum  number  n  =  1,  2,  3,  4,  .  .  .  , 

(2)  the  orbital  angular  momentum  quantum  number  /  =0,1,  2, 

....  (/i-l). 

(3)  the  magnetic  quantum  number  m  =  0,  ±  1,  .  .  .  ,  ±  /,  and 

(4)  the   spin   quantum   number  m  =  ±  1/2  (characterizing  the 
projection  of  the  spin  on  the  z  axis). 

In  the  case  of  //  coupling,  the  four  quantum  numbers  are  as 
follows: 

(1)  the  principal  quantum  number  n  , 

(2)  the  orbital  angular  momentum  quantum  number  /  , 

(  3)  the  total  angular  momentum  quantum  number  /  =  |  /  ±  f/«  I  ,  and 
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(4)  my  =  —  /,  — /-j-1  .  .  .  / — 1,  /  (characterizing  the  component 
of  the  total  angular  momentum  along  the  z  axis). 

As  we  have  already  said,  Russell-Saunders  coupling  (or  LS 
coupling)  is  characteristic  of  light  elements,  while  //  coupling  is 
characteristic  of  heavy  elements.  Both  types  of  coupling  give  the 
same  number  of  levels. 

A  group  of  closely  spaced  energy  levels,  which  is  separated 
from  other  energy  levels  by  an  appreciable  gap,  forms  a  shell 
made  up  of  several  subshells.  Just  as  for  the  hydrogen  atom,  the 
classification  of  the  shells  is  based,  as  a  rule,  on  the  principal 
quantum  number  n.  The  relationship  between  the  principal  quantum 
number  n  and  the  letters  which  are  used  to  denote  the  shells  in 
x-ray  spectroscopy  is  as  follows: 

n  Shell  symbol 

1  K 

2  L 

3  M 

4  N 

5  0 

6  P 
1  Q 


Within  the  shells,  electrons  with  values  of  the  orbital  quantum 
number  /  equal  to  0,  1,  2,  3,  .  .  .  form  the  s,  p,  d,  /,  .  .  .  sub- 
shells.  To  determine  how  the  shells  and  subshells  are  filled,  one 
should  take  into  account  the  Pauli  exclusion  principle,  according 
to  which  there  can  be  only  one  electron  in  each  quantum  state 
characterized  by  four  quantum  numbers.  In  a  state  with  fixed 
values  of  n,  /,  m,  there  can  be  at  most  two  electrons,  differing  in 
spin  direction.  Since  the  quantum  number  m,  which  varies  from 
-/to  +/,  can  take  21  +  1  values,  we  obtain  the  following  expression 
for  the  maximum  number  of  electrons  in  a  given  subshell: 


(25.1) 


The  same  value  for  N/  is  found  for  jj  coupling  For  given  values  of  the  three  quantum 
numbers  n,  I,  j,  the  fourth  quantum  number  nij  may  take  the  values  —  jt  ..  ,  ~-l/i,  l/a,  ,  J, 
corresponding  to  2j  +  1  states  Hence  we  obtain  the  same  value  for  N; 


where  ' 

;  =  N 
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It  follows  that  the  maximum  number  of  electrons  in  the  subshells 
s(i  =  0),  p(/  =  l),  c/(/  =  2)  and  f(l  =  3)  is  as  follows: 

Subshells  with  a  larger  value  of  /  are  not  encountered  in  unexcited 
atoms,  and  therefore  we  shall  not  discuss  them  here. 

We  can  now  find  the  maximum  number  of  electrons  which  can 
occupy  a  given  shell 

n-l 

Wrt  =  y   Wf  =  2(l  +  3  +  .. .  +  (2/i-—  l))  =  2/i  *  "*"2^~"1=2fta.    (25.2) 

1=0 

Thus  there  can  be  at  most  two  electrons  in  the  K  shell,  eight 
electrons  in  the  L  shell,  eighteen  electrons  in  the  M  shell,  32 
electrons  in  the  N  shell,  and  so  on.  It  should  be  mentioned  that 
these  relations  between  the  quantum  number  and  the  number  of 
states  in  the  various  shells  are  true,  generally  speaking,  only  for 
hydrogen- like  atoms. 

To  establish  the  order  in  which  the  shells  and  subshells  are 
filled  in  complex  atoms,  it  is  necessary  to  consider  the  mutual 
interaction  of  the  electrons.  As  we  have  emphasized  several 
times,  this  problem  cannot  be  solved  exactly.  The  application  of 
quantum  mechanics  to  multielectron  systems,  however,  has  led  to 
the  discovery  of  various  important  properties  of  such  systems, 
including,  in  particular,  the  exchange  interaction.  Moreover,  the 
development  of  approximation  methods  has  made  it  possible  to  ob- 
tain highly  satisfactory  quantitative  results  for  complex  atoms. 
As  indicated  in  Chapter  23,  the  simplest  approximation  methods 
are  the  variational  methods  developed  by  Ritz,  Hylleraas,  and 
others;  these  give  comparatively  good  results  for  the  light  atoms 
(up  to  potassium).  A  more  complete  analysis  of  the  structure  of 
an  atom  can  be  made  by  means  of  the  self-consistent  field  method 
developed  by  Hartree  and  Fock.  This  method  has  been  used  to 
determine  the  energy  and  the  electron  distribution  in  heavy  and 
light  atoms  (see  Chapter  23).  This  method  also  gives  the  shell 
structure  of  complex  atoms.2 

Unfortunately,  the  use  of  the  self -consistent  field  involves  some 
very  complicated  computations,  which  can  be  carried  out  only  by 
computers.  As  a  result,  the  eigenfunctions  characterizing  the 
electron  distribution  are  obtained  in  the  form  of  numerical  tables 
rather  than  analytical  expressions. 


2For  more  details  see  Hartree,  D    R    The  Calculation  of  Atomic  Structures,  New  York: 
John  Wiley  &  Sons,  Inc.,  1957. 
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B.   THE  THOMAS-  FERMI  STATISTICAL  METHOD 

Besides  the  approximation  methods,  there  are  statistical 
methods  that  have  been  introduced  on  the  basis  of  principles 
developed  by  Thomas  and  Fermi.  These  methods  apply  mainly  to 
heavy  atoms. 

In  the  statistical  treatment,  the  electrons  of  an  atom  are  re- 
garded as  a  degenerate  gas  with  T  =  0,  just  as  in  the  theory  of 
metals  (see  Chapter  6).  The  Thomas-Fermi  statistical  method  is, 
of  course,  less  accurate  than  the  Hartree-Fock  self  -consistent 
field  method,  because  many  features  of  the  behavior  of  the  in- 
dividual electrons  are  neglected.  In  spite  of  this  general  short- 
coming, the  Thomas-Fermi  method  provides  a  fairly  simple 
explanation  of  many  important  properties  which  are  characteristic 
for  the  average  behavior  of  electrons  in  an  atom. 

The  Thomas-Fermi  method  does  not  enable  us  to  find  the 
shell  structure  of  an  atom,  but  it  does  explain  several  important 
features  of  the  filling  of  the  electron  shells.  This  method  can  be 
used  for  calculating  the  total  bonding  energy  of  electrons  in  an 
atom  and  the  radii  of  atoms  and  ions.  It  can  also  be  used  to  deter- 
mine the  influence  of  screening  electron  shells  on  the  scattering 
of  fast  electrons,  bremsstrahlung,  and  the  creation  of  electron- 
positron  pairs  due  to  absorption  of  x-ray  photons  in  the  nuclear 
field. 

Presently  statistical  methods  are  being  successfully  used  to 
construct  a  theory  of  heavy  nuclei  and  a  theory  of  matter  at  high 
pressures  (for  example,  inside  stars). 

In  heavy  atoms  the  positively  charged  nucleus  is  surrounded  by  a  cloud  of  negatively 
charged  electrons,  which  partially  screen  the  electric  charge  of  the  nucleus.  The  po- 
tential of  an  ionized  atom  at  distances  greater  than  its  radius  is  given  in  the  first 
approximation  by  the  following  expression: 

tto^V-W",  (25.3) 


where  7.  is  the  atomic  number,  and  N  is  the  number  of  electrons.  For  a  neutral  atom 
Af=Z  and,  therefore,  <J>oo  =  0;  that  is,  the  electrons  completely  screen  the  charge 
of  the  nucleus. 

In  constructing  a  statistical  theory,  three  forms  of  energy  should  be  taken  into 
account. 

1.  The  electrostatic  energy  of  attraction  between  the  electrons  and  the  nucleus. 
This  energy  is  related  to  the  electron  density  po  (the  number  of  electrons  per  unit 
volume)  by  the  expression 


(25.4) 


where  e  —  —  e0  is  the  charge  of  the  electron,  and  $n  =  —  is  the  potential  produced  by 
the  nucleus. 
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2.  The  electrostatic  energy  of  repulsion  between  the  electrons 

^e-e  =-^y  ?  Po*e  ^a^,  (2Ma) 

where 


3.  The  kinetic  energy  of  the  electrons  in  the  atom.  Just  as  in  the  theory  of  solids  at 
absolute  zero,  the  average  kinetic  energy  of  an  individual  electron  is  related  to  the 
electron  density  PO  by  the  following  expression,  in  accordance  with  Eqs.  (6.33)  and 
(6.33a)3: 

rav  =  xp02/»,  (25.4b) 

where 

*  =  ra  £  <37t2)V'  =  TO  e*a°  (3*°)S/'  •  <25-s> 

Hence  the  kinetic  energy  of  the  electrons  is 

(25.6) 

The  total  energy  of  an  electron  gas  in  the  field  of  a  nucleus  is  equal  to  the  sum  of  the 
potential  energy,  which  consists  of  two  parts  [see  (25.4)  and  (25.4a)]f  and  the  kinetic 
energy  [see  (25.6)].  Thus  the  total  energy  is 

-.  j  P0v8^_!ojp"I>nl+V:  j  Saji^-w.     (25.7) 

The  density  of  the  electron  gas  must  satisfy  the  condition 

Po  </'*  =  #,  (25.8) 


where  N  is  the  number  of  electrons  in  the  atom, 

With  the   auxiliary  condition   (25.8),  the  variational  principle  can  be  formulated  as 
follows: 

a{£  +  *o<Mf}=0.  (25.9) 

Using  this  principle,  we  can  find  a  relationship  between  the  total  potential  $  =  <£n  +  $e 
and  the  density  of  electrons  p0 

Po  =  -r(2/Wof  o  (*  -  <T'o»  Va.  (25.  10) 
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where  the  Lagrangian  multiplier  <I>fl  .  which  plays  the  role  of  a  constant  potential,  must 
be  found  from  the  boundary  conditions.  In  the  derivation  of  (25,  10),  we  have  used  the 
relations 


(25.11) 


=  \  Q 
J 


fi^^ 


Substituting  Eq.  (25.10)  for  the  electron  density  into  Poisson's  equation  for  a  spheri- 
cally symmetric  electron  distribution 

va<fr  =  J.  ~  r$  =  4ne  Opo  (25.  12) 

and   recalling  that  $0  =  const,  we  obtain  the  Thomas-Fermi  equation  which  forms  the 
basis  of  the  statistical  model  of  the  atom 

7  S3  r  <*  -  *•)  =  -g§F  <2m»<'>v'  (*  -  *•>'''•  (25.13) 

When  a  specific  problem  is  investigated,  the  solution  of  Eq.  (25.13)  must  satisfy 
certaip  definite  boundary  conditions.  For  an  atom  the  boundary  conditions  can  be  given 
in  the  form 

$-$0  =  ?^     forr-^0  (25.14) 

and  r 

for  r  =  r,. 


Here    r0    is  determined  from  the  condition  that  the  electron  density  should  be  equal  to 
zero  at  /-  =  r0  >  that  Is,  p0  (r0)=  0.  Hence,  in  accordance  with  (25.10),  we  find 

(2 


With  the  help  of  Poisson's  equation  (25.12)  and  (25.  13),  the  condition  (25.8)  may  be 
written  as 


(2M7) 


It  follows  from  (25.16)  that  $0  =  0  and  ro  =  oo  for  a  neutral  atom  (N=Z).  Instead  of 
(25.17)  we  get,  therefore, 


This  condition  will  hold  if  the  second  boundary  condition  is  satisfied 

Hm  r$  =  0.  (25.18) 
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We  note  that  the  Thomas-Fermi  equation  (25,  13)  has  one  exact  solution 


as  can  be  easily  shown  by  substituting  (25.19)  into  (25.13). 

This  solution  for  a  neutral  atom  ($0  =  0)  will  satisfy  one  of  the  boundary  conditions 
for  r—  -co  [see  (25.18)].  The  second  boundary  condition  for  r-*0  [see  (25.  14)]f  however, 
will  not  be  satisfied. 

Unfortunately,  solutions  of  the  Thomas-Fermi  equation  which  satisfy  both  boundary 
conditions  cannot  be  expressed  in  analytic  form.4 

Substituting  (25.  14)  into  (25.  10)f  we  find  the  behavior  of  the  density  po  as  r  -»  0, 

Po  =  const  r"-8/a.  (25.20) 

The  solution  (25.19)  for  a  neutral  atom  gives  too  high  a  value  for  4>  asr—*co. 
The  more  exact  Hartree-Fock  method  shows  that  the  electron  density  will  decrease 
exponentially  as  r  —  ••  oo  . 

Since  we  are  Interested  only  in  the  basic  principles  of  the  statistical  method,  we  shall 
construct  an  approximate  statistical  theory  of  the  atom  with  the  help  of  the  variational 
method.  This  will  enable  us  to  formulate  a  solution  of  the  problem  in  analytic  form  with 
only  small  quantitative  deviations  which  are  of  no  importance  to  us. 


C.   SOLUTION  OF  THE  THOMAS- FERMI  PROBLEM 
BY  THE  RITZ  VARIATIONAL  METHOD 

Solving  a  problem  by  the  Ritz  variational  method,  one  can  propose  any  number  of 
test  functions  which  depend  on  the  different  values  of  the  variational  parameter  X. 

We  shall  choose  the  test  function  on  the  basis  of  the  following  considerations.  The 
function  should  agree  roughly  with  the  solution  of  the  Thomas-Fermi  equation  for 


4We  may  note  that  the  numerical  integration  of  this  equation  presents  two  advantages 
over  the  numerical  integration  of  the  Hartree-Fock  equations  First,  the  Thomas-Fermi 
equation  is  much  simpler  than  the  Hartree-Fock  equations  Second,  this  equation  and  the 
boundary  conditions  (for  example,  $  =  0  for  a  neutral  atom  with  Z  -  N)  can  be  transformed 
into  a  universal  form  which  is  independent  of  Z  To  do  this,  we  must  replace  $(r)  by  a 
new  function 


where 

*   =    a  '  °   =   C"  V128Z, 

With  the  function  f(x),  Eq.  (25.13)  becomes 

V*   ~f   ~-   f/2>  (25  13a) 

and  the  boundary  conditions  (25  14)  and  (25  18)  become 

/"(*)    ^    1       for      x  -»  0  ,  f(x)   =    0      for      x-»00.  (25  14a) 

These  equations  are  of  a  universal  character,  that  is,  they  do  not  necessarily  depend  on 
the  quantity  Z.  Therefore,  after  numerically  integrating  the  Thomas-Fermi  equation,  we 
can  change  the  variable  (which  depends  on  Z)  and  use  the  equation  to  investigate  any 
heavy  atom.  This  cannot  be  done  with  the  Hartree-Fock  equation. 
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r  -»  0  (this  region  is  the  most  important  with  regard  to  the  solution  of  the  problem  as  a 
whole),  and  it  should  have  a  comparatively  simple  form,  so  that  it  can  be  exactly  in- 
tegrated in  the  calculation  of  the  total  energy.  One  test  function  which  satisfied  these 
requirements  is 


v         16*ra/* 
This  function  is  already  normalized  to  the  total  number  of  electrons 

^  Po  d».v=  -5L-1  IJ  yf  e-  V\rdr  =  N,  (25.21a) 


and  therefore  the  auxiliary  condition  (25.8)  will  be  satisfied  automatically. 

For  r-*  0,  the  test  function  (25.21)  changes  in  the  same  way  (p0  ~r~8/2)  as  the 
solution  of  the  Thomas-Fermi  equation  [see  (25.20)].  As  we  shall  see  later,  tids  ex- 
plains the  good  quantitative  agreement  between  results  obtained  by  means  of  the  test 
function  (25.21)  and  those  obtained  by  means  of  the  potential  satisfying  the  Thomas- 
Fermi  equation. 

The  potential  due  to  the  electrons  in  the  atom  is 

<I>e  -  ~  -**  (  1  -  e~  ^r  -  lA7  e~  /x7).  (25.22) 

This  can  be  easily  shown  by  substituting  Eqs.  (25.21)  and  (25.22)  for    p0   and  $e  into  the 
equation 


Using  also  the  expression  $n  =  —  ,  we  can  see  that  the  total  potential  satisfies  the 
boundary  condition  (25.15)  for   r  =  ro-*oo,   when  the  charge  density  vanishes  together 


with  the  exponential  term    e~ 

We   can  also  find  an  expression  for  the  kinetic  energy  T  in  terms  of  the  variational 
parameter  X.   From  Eqs.  (25.6)  and  (25.21)  we  have 


We  have  the  following  expressions  for  the  potential  energy  due  to  the  interaction 
between  the  nucleus  and  the  electrons  [see  (25.4)]  and  for  the  potential  due  to  the  mutual 
interaction  of  the  electrons  [see  (25B4a)]: 


(25.24) 


-6--  (25.25) 
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Adding  Eqs.  (25.23)-(25.25),  we  find  that  the  total  energy  (25.7)  of  the  electron  cloud  Is 

£  =  /tta-£A,  (25.26) 

where 


The  varlational  parameter  X  ,  corresponding  to  the  reciprocal  of  the  effective  radius 
of  the   atom,  can  be  found  from  the  condition  for  the  minimum  of  the  total  energy  /:  of 

-\  rp 

the  atom,  that  is,  ^r  =  0.  Hence  we  find 


I  __    9    /3ny/s   _/V_j 
T~~100\2~J 


(25.28) 


9  \37c 


For  a  neutral  atom  (N  =  Z  ),  we  have 


a?  i?  (_2  y/-  £j  ^7/ 

9  64  \3ic/       «0 


=  „_  0  758  < 

*  <*  o 


(25.29) 


(25.30) 


It   is   worth  noting  that  numerical  integration  of  the  Thomas-Fermi  equation  leads  to  a 
value  for  the  energy  of  the  atom  which  is  very  close  to  that  given  by  (25.30) 


E  T-F    =  —0,709 . . .—  Z7/s  =  —  20.94  Z?/3  ev. 

«0 


(25.30a) 


The  expression  (25.30a)  (with  the  sign  reversed)  represents  the  total  bonding  energy 
(or  ionization  energy)  W  of  a  neutral  atom,  that  is,  the  energy  required  to  remove  all 
the  electrons  from  the  atom. 

The  theoretical  values  obtained  for  IF  from  (25.30a)  are  quite  reasonable  even  in  the 
case  of  the  hydrogen  atom,  but  they  are  somewhat  higher  than  the  corresponding  experi- 
mental values.  The  relative  error  decreases  with  increasing  Z  (see  Table  25.  IX 


Table  25.1 

Theoretical  and  experimental  values  of  the  total  ionization 
energy  W  of  atoms  (in  units  of  eQ*la0) 


Element 

w  theor 

w  exper 

H 
Li 
Na 
Hg 

0.769 
9.982 
206.9 
21207 

0.5 
7.5 
162 
18130 
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Concluding  this  section,  let  us  compare  the  curves  which  are  obtained  for  the  charge 
distribution  in  neutral  argon  (Z—18)  on  the  basis  of  the  Thomas-Fermi  statistical 
theory  and  on  the  basis  of  Hartree's  method  of  the  self-consistent  field  (see  Fig.  25.1). 
It  can  be  seen  from  the  graph  that  the  curve  of  ?0  computed  according  to  the  Hartree 
method  (Curve  B)  has  characteristic  maxima  and  minima  corresponding  to  the  electron 
shells,  whereas  the  curve  calculated  according  to  the  Thomas-Fermi  statistical  theory 
(Curve  A)  describes  only  the  average  behavior  of  the  electron  density,  and  therefore  has 
no  relative  maxima.  For  large  values  of  r  there  is  also  a  marked  lack  of  agreement 
between  the  two  curves:  the  Hartree-Fock  method  gives  functions  which  decrease 
exponentially  with  increasing  r ,  whereas  in  the  statistical  theory  the  decrease  is  pro- 
portional to  r'4  [see  (25.19)].s 


40 
20 


s 


X; 


3 
r/a0 


Fig  25.1.  Comparison  of  the  electron  density  distri- 
butions in  argon  (Z  =  18)  obtained  on  the  basis  of  the 
Thomas-Fermi  method  (Curve  A)  and  the  Hartree-Fock 
method  (Curve  B)  A  pseudologanthmic  scale  is  used 
for  the  ordmate  axis  (we  have  plotted  the  quantity 
&n(\  \  Da0),  where  D  -  Arrr2 p0)  Therefore,  the  graph 
will  be  linear  for  small  r>aQ,  and  logarithmic  for  large 
D(IQ  The  choice  of  this  scale  enables  us  to  follow 
the  variation  of  /QQ  at  large  values  of  r,  as  well  as  for 
r  <  a0 


D.  ENERGY  LEVELS  OF  ALKALI  METAL  ATOMS 

In  studying  spectral  lines  in  complex  atoms,  it  is  necessary  to 
distinguish  between  outer  and  inner  shells. 

In  the  hydrogen  atom  there  is  only  an  outer  shell  (the  K  shell), 
which  contains  one  electron.  In  helium  (Z  =  2),  the  /<"  shell  is 
completely  filled  (a  noble  or  inert  gas).  In  litnium  (an  alkali  metal 
in  Group  I  of  the  periodic  table,  Z  =  3),  the  inner  shell  (K  shell)  is 
completely  filled,  but  the  outer  L  shell  has  only  one  electron. 
The  filling  of  the  L  shell  is  completed  in  Ne(Z  — 10).  In  sodium 
(an  alkali  metal,  (Z  =  11) ,  the  inner  K  and  L  shells  are  completely 


A  detailed  exposition  of  the  statistical  theory  of  an  atom  is  given  m  P.   Gombas,  Die 
The  one  des  Atoms  und  ihre  Anwendungen,  Vienna*  Springer- Verlag,  1949. 
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filled,  but  there  is  one  electron  in  the  outer  M  shell.  The  filling  of 
the  shells  in  these  atoms  is  illustrated  in  Fig.  25.2. 

It  should  be  noted  that  the  bonding  energy  of  an  electron  in  an 
inner  shell  is  much  greater  that  that  of  an  electron  in  an  outer 
shell.  An  indication  of  this  is  provided  by  the  ionization  energy, 
which  is  more  than  20  ev  for  inert  gases,  whereas  for  alkali  metals 
it  is  only  slightly  more  than  5  ev.  The  removal  of  the  first  valence 
electron  in  lithium  requires  the  expenditure  of  5.39  ev,  but  re- 
moval of  the  second  and  third  electrons  from  the  inner  shells  re- 
quires the  expenditure  of  76  and  122  ev,  respectively. 


Fig.  25.2.  Diagram  of  the  filling  of  electron 
shells  in  different  atoms.  On  the  right,  atoms 
with  completely  filled  shells  (inert  gases)  The 
dark  spots  represent  electrons  and  the  light 
spots  (with  a  plus  sign  in  the  middle)  represent 
nuclei 


The  atoms  in  Group  I  of  the  periodic  table  (Li,  Na,  K,  Rb,  Cs, 
etc.)  are  known  as  alkali  metals;  each  has  an  outer  shell  containing 
one  electron,  just  like  the  hydrogen  atom.  Their  optical  and 
chemical  properties,  therefore,  should  be  essentially  the  same  as 
those  of  the  hydrogen  atom  (for  example,  all  of  these  elements  are 
monovalent  and  they  all  exhibit  doublet  splitting  of  the  spectral 
terms). 

The  optical  spectrum  is  caused  by  the  transition  of  a  valence 
electron  (that  is,  an  electron  in  the  outer  shell)  from  an  excited 
state  to  a  lower  state.  The  excitation  of  electrons  in  the  inner 
shells  generally  requires  considerably  more  energy  than  does  that 
of  electrons  in  the  outer  shells ;  therefore,  downward  transitions  from 
excited  states  to  the  ground  states  of  the  inner  shell  are  accompanied 
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by  the  emission  of  x-rays  (see  Chapter  26).  The  atomic  nucleus 
and  the  electrons  in  the  inner  shells  together  form  what  is  known 
as  the  atomic  core.  Thus  the  charge  of  the  atomic  core  is  equal 
to  Zn=:Z  —  N,  where  N  is  the  number  of  electrons  in  the  inner 
shells.  For  alkali  metals  (Li,  Na,  etc.)  A/  =  Z—1,  and  thus  the 
charge  of  the  atomic  core  is  equal  to  unity  (Za=  1).  Therefore, 
the  main  part  of  the  potential  energy  which  retains  the  outer 
electron  in  the  metal  is  the  same  as  for  hydrogen,  namely, 


K0  =  -z«  =  -7'  (25'31) 

The  analysis  of  the  spectra  of  alkali  metals  can  thus  be  based  on 
the  corresponding  expression  for  the  energy  which  was  obtained 
in  Chapter  13 

£S  =  -^.  (25-32) 

Similarly,  we  can  use  the  hydrogen  wave  functions  as  the  zero- 
order  approximation  of  the  wave  functions 

*°  =*«/«•  (25.33) 

In  alkali  metals,  however,  we  cannot  confine  ourselves  to  the 
Coulomb  energy  in  treating  the  interaction  between  the  valence 
electron  and  the  atomic  core,  and  we  must  also  take  into  account 
the  polarization  forces  and  the  "smearing*  '  of  the  atomic  core 
over  a  finite  volume.  This  yields  various  corrections  and  removes 
the  /  degeneracy  which  occurs  in  hydrogen. 

In  Bohr's  semiclassical  theory,  the  orbits  of  valence  electrons 
were  rigorously  divided  into  orbits  which  penetrate  the  atomic 
core,  and  those  which  do  not.  In  the  case  of  "nonpenetrating" 
orbits  (orbits  with  nearly  circular  trajectories)  we  need  to  con- 
sider only  the  polarization  forces,  since  the  potential  outside  the 
atomic  core  (that  is,  outside  the  inner  shells)  is  completely 
independent  of  the  radial  distribution  of  a  spherically  symmetric 
charge.  The  radial  distribution  is  very  important  only  for  c  'pen- 
etrating" orbits  (elongated  ellipses)  (see  Fig.  25.3). 

In  quantum  theory  the  concept  of  a  trajectory  is  no  longer 
meaningful.  In  order  to  classify  orbits  as  "nonpenetrating"  or 
"penetrating,"  we  must  introduce  a  new  convention;  namely,  if 
the  wave  function  describing  the  behavior  of  the  valence  electron 
can  be  set  equal  to  zero  inside  the  atomic  core,  the  orbit  is 
"nonpenetrating,"  and  if  on  the  contrary  the  wave  function  can- 
not be  set  equal  to  zero  inside  the  atomic  core,  the  orbit  is 
*  'penetrating." 
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It  should  be  noted  in  this  connection  that  the  s  orbit  of  a  complex 
atom  is  always  penetrating,  since  its  wave  function  differs  from 
zero  inside  the  atomic  core  and,  indeed,  even  inside  the  nucleus 


(25.34) 


We  shall  now  calculate  the  polarization  forces  that  arise  between 
an  outer  electron  and  the  atomic  core.  The  outer  electron  will 
obviously  repel  electrons  in  the  inner  shells  and  attract  the  nucleus. 
As  a  consequence,  the  atomic  core  is  polarized,  and  a  polarization 
force  arises  between  it  and  the  outer  electron 


l 


1 


2*5(2— l).v 


(25.35) 


represents   the   polarization  of  the 


Nonpenetrating 
orbit 


The    quantity  e()  (Z  —  1 )  x  = 
atomic  core. 

If  we  regard  the  atomic  core  as 
an  elastic  dipole,  we  can  set 

p  =  p§,  (25.36) 

where   p   is  the  polarizability  of  the 
atom,  and 


&==';  (25.37) 

Fig.  25  3.  *Nonpenetrating"  and  "pene- 
trating" orbits  in  alkali  metal  atoms 

is  the  electric  field  produced  by  the 

outer   electron   at  the  center  of  the 

atomic  core.   Thus  we  can  obtain  the  following  expression  for  the 

potential  energy  associated  with  the  polarization6: 


— S 


(25.38) 


Since  the  polarization  can  be  regarded  as  a  perturbation  in  this 
problem,  we  have  the  following  expression  for  the  polarization 
energy: 


Ep  =  J 


=  -  ^  (1). 


(25.39) 


The  coefficient  of  polarization  P>  is  usually  determined  from  semiempmcal  formulas 
numerical  values  for  alkali  metal  ions  (cores)  are  as  follows  (in  units  of  10""       cm3): 

003(Li+),    0  19<Na+),    089(K+),     1.50(Rb+),    260(Cs+) 
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Since,  according  to  (13.29a), 


3  1~~ 

:2i* ~ 


/(/+0 


relation  (25.39)  may  be  reduced  to 

A£P  =  "~"^i?'  (25.40) 

where 


4ai   /_ 


__ 

1)  (/  +  1)  (/  +  -|)   '  (25.41) 


We  can,  therefore,  find  the  total  energy,  which  in  this  case  depends 
upon  both  /  and  n  (for  the  time  being,  we  are  neglecting  spin 
corrections) 


Substituting  expression  (25.40)  for  A£p  and  using  the  relation 


we  obtain 


since 

1  1    /,         M'2          1      ,    28 


Introducing  the  "effective  principal  quantum  number"  neff 
we  get 
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It  should  be  noted  that  Eq.  (25.41)  cannot  be  used  for  s  states, 
since  the  coefficient  Sj  becomes  infinite  when  /  =  0.  This  happens 
because  polarization  forces  can  be  introduced  meaningfully  only 
in  the  case  where  the  outer  electron  is  at  a  sufficiently  great 
distance  from  the  atomic  core.  For  s  terms  the  wave  function  does 
not  vanish  even  when  r  =  0  [see  (25.34)]. 

The  influence  of  the  inner  electrons  on  the  s  orbits  (penetrating) 
is  due  mainly  to  the  "smearing"  of  the  electronic  cloud  of  the  atomic 
core.  In  general,  the  additional  energy  due  to  the  "smearing"  of 
the  electrons  over  the  volume  of  the  atomic  core  is  given  by 

.  (25.44) 

where  Vvoi  is  the  difference  between  the  potential  energy  produced 
by  the  electrons  of  the  atomic  core,  taking  into  account  their 
distribution  over  a  finite  volume,  and  the  potential  energy  pro- 
duced by  an  equivalent  charge  concentrated  at  the  center. 

To  estimate  the  order  of  magnitude  of  the  correction  S  for  s 
terms,  let  us  assume  that  the  Z—  1  electrons  of  the  inner  shells 
fill  uniformly  a  volume  of  radius  R.  We  then  have 


Replacing  the  wave  function  for  the  s  states  by  its  value  at  the  origin 
[see  (25.34)],  we  find  the  following  approximate  expression  for  the 
additional  energy  of  the  s  terms7: 


.  <««•> 

where 


=  --- 
—  5    a*  ' 

This  expression  no  longer  diverges. 

7As  a  rule,  the  correction  S  for  penetrating  orbits  is  taken  into  account  in  the  following 
manner  It  is  assumed  that  the  orbit  consists  of  two  parts:  an  outer  part  and  an  inner  one 
In  the  outer  part  of  the  orbit  the  electron  is  acted  upon  by  a  point  charge  cJ0Za  (m  a^ali 
metals  Za  =•  1),  in  the  inner  orbit,  the  electron  experiences  the  effect  of  point  charge  CQ^, 
which  should  be  greater  than  eQZa  because  the  screening  effect  of  the  inner  electrons  is 
reduced.  It  is  difficult  to  determine  the  charge  Zit  theoretically,  and  it  is  usually  regarded 
as  an  empirical  parameter. 

We  thus  obtain  the  following  equation  for  & 

s,  -    -z,-—       l  -    +     -  <2S43> 


where  a  is  the  maximum  distance  from  the  nucleus  to  the  electron  in  the  inner  ellipsoid 
(that  is,  the  radius  of  the  atomic  core)  It  is  easily  shown  that  the  quantity  8j  decreases 
rapidly  with  increasing  /.  A  particularly  good  value  of  6^  is  obtained  for  s  terms. 
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We  may  note  here  that  according  to  the  Thomas- Fermi  model, 
the  radius  of  an  atom  [see  (25.28)]  is 

#==^r,  (25.48) 


where  the  coefficient  T  >  which  characterizes  the  distribution  of  the 
charge  inside  the  atom,  is  of  the  order  of  unity. 

Consequently,  for  the  total  energy  of  an  electron  in  the  case  of 
"penetrating"  s  orbits,  we  again  obtain  an  equation  of  the  same 
form  as  (25.42) 


f"eff 


(25.49) 


where  /ieff  =«  — 8f  but  now  8  is  given  by  (25.47). 

In  order  to  analyze  the  difference  between  the  corrections 
&  for  "penetrating"  and  c 'nonpenetrating"  orbits,  we  shall  take  as 
an  example  the  Li  atom. 

In  this  atom  the  p  orbit  (I—  l)isnonpenetrating.  Equation  (25.41) 
gives  the  value  8^0.04  for  the  lowest  state  (n  =  '2).  According 
to  (25.47),  the  corresponding  expression  for  8^  for  penetrating  s 
orbits  must  be  one  order  of  magnitude  greater. 

It  should  be  noted  that  for  increasing  n  and  1=  const,  the 
eccentricity  approaches  unity;  that  is,  the  elliptical  "orbits" 
become  more  and  more  elongated: 


(25.50) 

In  heavy  nuclei,  therefore,  there  will  also  be  penetrating  orbits 
with  /  =  0  as  well  as  those  with  larger  values  of/.  This  is  re- 
flected in  the  corrections  8  for  the  alkali  metal  spectra,  whose 
values  are  given  in  Table  25.2. 


Table  25.2 
Corrections  &  for  the  spectra  of  alkali  metals 8 


z 

Element 

s. 

SP 

8rf 

5, 

1 

H 

0.000 

0,000 

0.000 

0.000 

3 

Li 

0412* 

0.041 

0.002 

0.000 

11 

Na 

1.873* 

0.883  > 

0.010 

0.00  1 

19 

K 

2.230* 

1.776* 

0.1  46* 

0.007 

37 

Rb 

3.195* 

2.711* 

1  233* 

0012 

55 

Cs 

4.131* 

3.649* 

2.448* 

0.022 

The  values  of  S  for  penetrating  orbits  are  indicated  by  an  asterisk 
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It  follows  from  Eq.  (25.49)  and  also  from  the  Table  25.2  that, 
for  a  given  n ,  the  largest  downward  shift  (that  is,  in  the  direction 
of  decreasing  energy)  associated  with  the  "smearing"  of  the  inner 
electrons  over  a  finite  volume  occurs  in  states  with  the  lowest  /. 
In  other  words,  the  largest  shift  occurs  for  s  terms. Q 

The  hydrogen  atom  alone  has  no  penetrating  orbits.  For  the  Li 
atom  (Z  =  3;  that  is,  the  next  element  after  hydrogen  in  the  first 
group  of  the  periodic  table),  the  only  penetrating  orbit  is  the  outer 
s  orbit.  In  the  next  alkali  metal,  namely,  the  Na  atom  (Z=ll),  the 
s  and  p  orbits  are  penetrating. 


E.    FUNDAMENTAL  SERIES 

The   energy   levels   of  the  hydrogen  atom  without  relativistic 
corrections  are  given  by  the  well-known  relation 

~^  =  £.  (25.51) 

From  this  we  can  find  the  values  of  the  spectral  terms 

(IS)  =         =/?, 


(3s)  =  (3p)  =  (3d)  =  -£  =     . 

It  follows  that  the  states  of  hydrogen  are  degenerate  with  respect 
to  both  /  and  m  .  A  schematic  diagram  of  the  energy  levels  in  the 
hydrogen  atom  is  given  in  Fig.  25.4. 

In  the  Li  atom,  the  energy  levels  of  the  K  shell  (n==  1)  are  filled 
(see  Fig.  25.2),  and  therefore  it  is  the  L  shell  which  is  the  outer  one. 
Table  25.2  shows  that  the  K  shell  exerts  the  strongest  influence  on 
s  terms,  and  for  the  corresponding  terms  we  have 


(„  __  i  +  o  588)*" 

This   shift   is   so   large  that  it  was  difficult  to  determine  experi- 
mentally whether  it  belonged  to  n  states  or  n--\  states.  In  order 


In  Chapter  21,  when  the  finite  size  of  the  nucleus  was  taken  into  account,  it  was  also 
found  that  the  largest  shift  occurs  for  s  terms.  In  that  case,  however,  it  was  the  positive 
charge  that  was  spread  over  a  finite  volume,  and  therefore  the  terms  were  shifted  upwards 
and  not  downwards 
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to   make  the  term  notation  resemble  that  of  hydrogen,  spectros- 
copists  have  originally  attributed  the  shift  to  the  n—~  \  state10 


(25.52a) 


where  n*  =  /i—  -  1  and  s=  1  —  S5  =  0.588  .  We  shall  use  an  asterisk 
for  the  original  notation  (n*s)  in  order  to  distinguish  it  from  the 
correct  one  (ns)  . 


H 


Fig.  25-4.  Energy-level  diagram  of  monovalent 
atoms  The  potential  is  usually  measured  (in 
fu)  from  the  lowest  level  upwards  Here, 
however,  we  wish  to  compare  the  energy 
levels  of  different  atoms  and,  therefore,  we 
have  taken  the  potential  at  infinity  as  the 
zero  level 

The  shift  of  the  other  terms  of  lithium  (/=!,  2)  relative  to  the 
corresponding  terms  of  hydrogen  is  negligible 


R 


(«  +  < 


10If  the  principal  quantum  number  n  in  lithium  assumes  the  values  n  -  2,  3,  4  (the  term 
n  —  1  is  occupied  by  two  electrons  and  forms  an  inner  shell),  than  the  quantum  number  n* 
takes  the  values  n*  -  lf  2,  3.  •  .  . 
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where 

p=-ap=-o.04i, 

d  =  —  arf  =  -  0.002. 

The  shell  to  which  a  shift  belonged  could  be,  therefore,  uniquely 
determined  and  in  the  old  notation  the  p,  d,  and  other  terms  were 
placed  in  shells  which  were  later  shown  to  be  theoretically  correct 
(that  is,  n*  =n  ;  see  Fig*  25.4). 

In  the  next  alkali  metal,  Na  (Z=  11),  the  inner  (filled)  shells  are 
the  K  and  L  shells  (see  Fig.  25.2).  As  can  be  seen  from  Table  25.2, 
the  inner  shells  of  Na  have  a  pronounced  influence  on  both  the  s 
terms  and  the  p  terms.  The  original  notation  which  was  used  for 
the  s  terms  was 

n  p 

(ns)  =  (n*s)  =  (/l_li373)i  =  -fi*-+  07627)*  •  (25.53) 


where  n*=n  —  2,  and  s=—  S5  +  2  =  0.627.  Thus,  spectroscopists 
had  originally  assumed  values  of  the  principal  quantum  number  for 
the  s  terms  of  sodium  which  were  low  by  two  units.11 

The  original  notation  that  was  used  for  the  p  terms  was 


(25.54) 

where  n*=n—  1,  p=  1  —8^  =  0.117.  Thus,  the  principal  quantum 
number  for  the  p  terms  was  reduced  by  one  unit.  The  corrections 
for  the  states  dt  f,  .  .  .  were  negligible,  and  these  terms  had  been 
assigned  to  the  shells  that  were  later  obtained  from  the  theory. 
The  energy-level  diagram  of  Na  is  given  in  Fig.  25.4. 

The  spectral  series  of  alkali  metals  are  as  follows. 

1.  The  principal  series.  The  variable  term  is  the  p  (principal) 
term.  The  spectral  frequencies  in  this  series  are  given  by 


which  means 


for   H:(ls)-(/zp), 

for   Li:  (25)  —  (np),  (25.55) 

for   Na  (35)  —  (np). 


11  At  present,  spectroscopists  have  also  adopted  a  notation  which  follows  from  theoreti- 
cal calculations    This  is  the  notation  used  in  Fig.  25  4 
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2.  The  sharp  series.   The  variable  term  is  the  s  (sharp)  term. 
The  spectral  frequencies  are  given  by 

«>  =  (2*p)  -("*«). 
which  means 

for    H:(2/?)  —  (/is), 

for   Li:(2p)  —  (/is),  (25.56) 

for  Na:(3p)  —  (/is). 

3.  The  diffuse  series.  The  variable  term  is  the  d  (diffuse)  term. 
The  spectral  frequencies  are  given  by 

G>  =  (2*p)  —  (n*d).  (25.57) 

4.  The  fundamental  series. 

co  =  (3*d)  —  (/iV).  (25.58) 

The  variable  term  is  the  /  (fundamental)  term. 

These  series  take  into  account  the  selection  rule 


The    names   of  the   series   partly   reflect  the   character  of  their 
multiplet  structure. 


F.    MULTIPLET  STRUCTURE  OF  THE  SPECTRAL  LINES 

Just  as  in  the  hydrogen  atom,  the  multiplet  structure  of  the 
spectral  lines  of  alkali  metals  is  due  to  the  spin  and  relativistic 
effects.  To  find  the  splitting  of  the  terms,  let  us  use  a  relation 
including  both  the  relativistic  and  spin-orbit  corrections  for  a 
hydrogen-like  atom  [see  (20.18)] 

LEntJ_RZ**l      n  3\  (2559) 

*    ~~    ^  \7F7T")'  ^S'Sy> 

where   a.  =  e*Jfic=  1/137  is  the  fine- structure  constant. 

To  account  for  the  effect  of  electrons  from  the  inner  shells  of  al- 
kali metals,  we  simply  replace  Z  by  a  certain  effective  value  Zeff 
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For  "nonpenetrating"  orbits,  we  may  set  Zeff  =  1,  because  all 
the  Z — 1  electrons  will  screen  the  positive  charge  of  the  nucleus. 
For  '  'penetrating' '  orbits  the  best  value  of  Zea  is  chosen  from  a 
comparison  with  experiment. 


du>4      iT 


dWjUI 


2s 


Principal  series,  con-         Sharp  series,  equl- 
verging  doublets  distant  doublets 


Fig    25.5.  Spectra  of  alkali  metals 


Since  the  total  angular  momentum  quantum  number  /  assumes 
the  values 


for  /=o 


and 


for 


we  may  conclude  that  all  spectral  terms  of  alkali  metals  should  be 
doublets,  except  for  the  s  term,  where  there  is  no  splitting. 

In  order  to  find  the  magnitude  of  the  splitting,  let  us  calculate 
the  value  of  the  spectral  terms  for  two  cases?  first,  when  the  spin 
and  the  orbital  angular  momentum  are  parallel 


(25.61) 


and,  second,  when  they  are  antiparallel 

A£yi-Vj_  ^Zeff     In         3\ 

n      "  ~~      /i*      \i      4 )  • 


(25.62) 
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The  splitting  of  the  terms  is  equal  to  the  difference  between  (25.62) 
and  (25.61)  12 


1)'  (25.63) 

We  shall  use  this  equation  to  explain  the  doublet  splitting  of  the 
principal  series,  that  is,  transitions  originating  from  the  p  levels. 
Setting  /  =  1  we  find 

(25.64) 

It  is  evident  now  that  the  decrease  in  splitting  will  be  inversely  pro- 
portional to  the  cube  of  the  principal  quantum  number  n  (see  Fig. 
25.5);  that  is,  the  spectral  of  the  mamseries  are  narrowing  doublets. 

In  the  sharp  series,  the  initial 
level  is  a  doublet  and  the  final  level 
(with  variable  s)  is  a  singlet  (see 
Fig.  25.5).  The  distance  between  the 
doublets  in  the  sharp  series  is  con- 
stant (equidistant  doublets) 


2  =  Aw3  =  .  .  .  =  Act),,  = 


~~~         16 

In  particular,  for  the  spectrum  of 
Li  we  have  Zeff  =  1,  and  n*  =  2  (the 
p  orbit  is  "nonpenetrating"),  so  that 
this  constant  splitting  is  equal  to 

Fig.  25.6.    Diagram    of    the    spectral 

terms    of  the   sodium   atom   including  /^a 

the  fine  structure  A(i)z  =  -yg-  . 

A   diagram  of  the  multiplet  structure  of  the  principal  and  sharp 
series  of  sodium  (Z~  11)    is  given  in  Fig.  25.6. 

The  diffuse  series  does  not  obey  such  an  explicit  law  for  the 
splitting  of  spectral  lines.  Each  line  will  be  split  into  three  rather 
than  four  levels  (see  Fig.  25.7),  since  in  this  case  in  addition  to 
the  selection  rules  A/  =  itl  ,  it  is  also  necessary  to  take  into 


12 
Several  other  formulas  have  also  been  proposed  for  the  doublet  splitting  For  example, 

on  the  basis  of  the  quasi-classical  picture  of  penetrating  orbits,  Lande  proposed  a  formula 
replacing  in  (25  63)  n  by  neff  ~  n  -  S  and  Z*ff  by  Z\  zj ,  where  7,a  is  the  total  charge  of 
the  ion  and  Zt  is  the  effective  charge  of  the  nucleus  in  the  inner  region  [see  Eq.  (25.43)] 
into  which  the  orbit  penetrates.  For  further  details  we  refer  the  reader  to  E.  Condon  and 
G.  Shortley,  Theory  of  Atomic  Spectra,  New  York:  Cambridge  University  Press,1958. 


OPTICAL    SPECTRA    OF    ALKALI     METALS 


419 


account  the  selection  rule  for  the  total  angular  momentum  quantum 
number 


The   spectral   lines   of  the  fundamental  series  are  also  split  into 
three  components. 


y>  "5/2 

n  a  3/2 

* 

Fig.  25.7.     Splitting    of    the    diffuse 
series 

The  multiplet  splitting  of  the  spectral  lines  of  monovalent  atoms 
may  be  explained  only  by  taking  into  account  the  spin  properties  of 
electrons.  We  have  already  and  repeatedly  stated  that  only  the 
half- integral  quantum  numbers  (which  characterize  the  spin)  can 
lead  to  doublet  splitting  of  the  terms  (as  in  the  Stern-Gerlach 
experiments). 


Chapter  26 


Mendeleyev's   Periodic  System  of  Elements 


A.    X-RAY  SPECTRA  OF  ATOMS 

X-ray  spectra  provide  important  information  on  the  structure 
of  the  inner  shells  of  atoms  and  are  therefore  useful  in  studying 
the  sequence  in  which  the  shells  are  filled  by  electrons.  We  recall 
that  x-rays  are  emitted  when  a  beam  of  fast  electrons  strikes  the 
plate  of  a  cathode-ray  tube  (see  Fig.  26.1).  An  analysis  of  the 
emerging  x-rays  shows  that  they  consist  of  two  different  types  of 
spectra— a  continuous  and  a  line  spectrum.  The  continuous  x-ray 
spectrum  arises  as  a  result  of  the  deceleration  of  electrons  when 
they  strike  the  target.  The  continuous  spectrum  is  therefore  a 
type  of  bremsstrahlung.  If  the  deceleration  of  electrons  is  equal 
to  w(w<^0),  then  according  to  classical  electrodynamics,  the  energy 
radiated  by  the  electrons  per  unit  time  is  given  by  the  relation 

~~  N~  7T~cr* 

A  characteristic  feature  of  the  continuous  spectrum  is  that  it 
is  cut  off  at  short  wavelengths.  The  wavelength  A min  at  which  this 
occurs  decreases  as  the  potential  difference  between  the  cathode 
and  the  anode  increases;  the  cutoff  wavelength  Xmin  can  be  deter- 
mined from  the  law  of  conservation  of  energy 

y  *         !     mO& 

where  V==e^l)  is  the  electron  energy  before  colliding  with  the 
target,  <t>  is  the  potential  difference  between  the  anode  and  the 
cathode,  ftu>  is  the  energy  of  the  emitted  bremsstrahlung  photon, 

and  -1-—  is  the  kinetic  energy  of  the  electron  after  colliding  with 
the  target. 

Introducing  the  wavelength  x=s=  —  instead  of  the  frequency  o>, 
we  find  that 

x=         ch 
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From  the  last  equation  it  is  evident  that  X  can  vary  from  /  =  oo  (as 
in  the  case  when  an  electron  does  not  lose  any  energy  during  the 

collision  with  the  target (y  mtfJ*  —  V)  to  a  certain  minimum  value 


X       —  *- 
min —  i/» 


(26.1) 

as  in  the  case  when  an  electron  loses  all  its  energy  in  the  collision 


Fig.  26.1.  Schematic  diagram  of  an  x-ray  tube. 
C  -  cathode,   T  -  target 

The   dependence   of  the   radiation   intensity  on    ^  is  plotted  in 
Fig.  26.2  for  two  values  of  the  energy  of  the  primary  electrons  (25 
and  50   kev).    It  is  clearly  seen  that  an  increase  in  <t>  corresponds 
to   a  decrease   of  the   cutoff  wave- 
length Xmm,in  accordance  with  Eq.  ,     50 kev 
(26.1).  This  equation  was  successfully               w  n 
used  in  the  determination  of  a  more              | 
accurate  value  of  Planck's  constant               j>    9 
h  than  was  given  by  the  Wien  and  the               g 
Stefan-Boltzmann  laws  (see  Chapter              U   6 

!)•  £ 

When  the  energy  of  the  electrons  g   j 

incident  on  the  target  exceeds  a  cer-  flj 

tain  critical  value,  determined  by  the  ~  Q 

target    material,    a   line    spectrum,  l 

which  is  known  as  the  characteristic 
spectrum,  is  superposed  on  the  con- 
tinuous spectrum.  The  line  spectrum 
characterizes  material  of  which  the 
target  is  made  (or  rather,  the  struc- 
ture of  the  inner  shells  of  the  target  material)  in  the  same  way  as 
the  optical  spectrum  characterizes  the  structure  of  the  outer  shells 
of  the  atom.    For  example,  in  the  case  of  a  rhodium  target,  a  line 


0*     06      08 
Wavelength,  A 


f,Q 


Fig.  26.2.  Short  wavelength  lim- 
its    of     the     continuous     x-ray 
spectrum. 
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spectrum  begins  to  be  superposed  on  the  continuous  spectrum  at 
an  energy  of  31.8  kev. 

The  properties  of  the  line  spectrum  are  identical  for  all  chem- 
ical compounds  of  a  given  element.  The  characteristic  x-ray 
spectrum  and  the  optical  spectrum  are  different  in  this  respect, 
since  the  latter  depends  on  whether  the  substance  occurs  in  atomic 
or  molecular  state  (for  example,  the  optical  spectra  of  atomic 
oxygen,  the  Og  molecule,  and  the  HsO  molecule  are  completely 
different).  This  can  be  readily  understood  since  only  electrons  of 
the  outer  orbits  participate  in  chemical  bonding. 

The  spectral  lines  of  the  characteristic  x-ray  spectrum  form 
regular  sequences  or  series,  just  like  the  optical  lines  of  atoms. 
These  series  are  designated  by  the  capital  letters  K9  L,  M,  N,  .  .  .  ; 
the  K  series  has  the  shortest  wavelength,  followed  by  the  L  series, 
and  so  on. 


B.  CHARACTERISTIC  SPECTRA  OF  ATOMS  AND  THE 
STRUCTURE  OF  THEIR  INNER  SHELLS 

The  mechanism  which  is  responsible  for  the  characteristic 
x-ray  spectrum  of  an  element  was  first  explained  by  Kossel  (1914). 
Let  us  suppose  that  an  electron  which  is  incident  on  a  target  re- 
moves an  electron  from,  for  example,  the  K  shell  of  the  target 
atom,  and  thus  leaves  a  vacant  site  in  the  /(  shell  (see  Fig.  26.3). 

An  electron  may  jump  from  the  L, 
M,  A/,  ...  shells  to  this  vacant  site, 
giving  rise  to  x-ray  lines  (denoted 
by  /<a,  Kp,  Kr  >...)•  The  character- 
istic x-ray  spectrum  is  thus  formed 
as  a  result  of  transitions  of  electrons 
from  one  inner  shell  to  another.  Since 
the  bonding  energy  of  electrons  mov- 
ing in  the  inner  orbits  is  much 
greater  than  that  of  the  outer  elec- 
trons. Electrons  of  much  greater 
F,fr  26.3.  schemata  diagram  of  e™rW  (several  tens  of  kev)  are  re- 

the   origm   of  a  characteristic          quired  for  production  of  the  charac- 
x-ray  spectrum  according  to  KOS-  teristic  x-ray  spectrum  than  for  the 

sei:  •-electrons.  The  dashed  hne          excitation  of  optical  spectra  (where 
represents   the  ejection  of  an  several   tens   of  ev   are  sufficient). 

electron  from  the  K  shell  methodg    ^    can  be  uged  lft 


constructing   a   theory   of  the  com- 

plex atom,  while  taking  into  account  the  interaction  of  atomic 
electrons.  In  the  first  method  the  main  potential  is  taken  to  be 
the  potential  of  the  nucleus  when  it  is  completely  screened  by 
the  inner  electrons.  We  used  this  method  in  constructing  the  theory 
of  the  optical  spectra  of  alkali  metals.  The  potential  was  determined 
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by  the  nuclear  charge  (Ze0)  and  the  charge  of  the  electrons  in  the 
inner  orbitals  [—  (Z  —  I)e0  ].  The  total  potential  was  equal  to 

$  =  ?i(Z  —  (Z—  !))  =  £.  (26.2) 

As  the  perturbation  potential  we  selected  an  additional  potential 
which  took  into  account  the  polarization  and  the  space  distribution 
of  the  electron  cloud.  This  method  is  particularly  suitable  for  outer 
electrons,  as,  for  example,  in  atoms  of  alkali  metals. 

In  studying  the  motion  of  electrons  in  the  inner  shells,  it  is  con- 
venient to  use  the  potential  of  the  nucleus 

0  =  ^  (26.3) 

as  the  main  term  in  the  expression  for  the  potential,  and  to  regard 
the  additional  potential  produced  by  the  electron  shell  as  a  correc- 
tion. In  this  case,  the  presence  of  electron  shells  will  result  in  a 
screening  (an  effective  decrease)  of  the  nuclear  charge  ZeQ  by  the 
amount  Srte0,  and  the  total  potential  will  be 

0  =  (z-st)*Q.  (26.4) 


For  example,  in  the  investigation  of  helium-like  atoms,  it  was 
shown  that  the  interaction  of  the  electrons  in  the  /<  shell  reduces 
the  effective  nuclear  charge  [see  (25.35)],  which  then  becomes 

Z'  ->  Z  -—  ~  .  Thus  the  quantity  sn  is  equal  to  tt/16  in  this  case. 

The  screening  constant  Sn  must  be  a  function  of  both  n  and  /  . 
It  becomes  larger  as  n  increases  since  there  is  a  greater  number 
of  electrons  screening  the  nucleus.  It  also  becomes  larger  (but 
more  slowly)  as  /  increases,  since  the  orbits  become  less  and  less 
penetrating  and  the  effective  charge  decreases  somewhat  on  the 
average.  In  the  first  approximation  it  may  be  assumed  that  the 
screening  constant  is  independent  of  /. 

The  potential  (26.4)  gives  the  same  formula  for  the  spectral 
terms  as  was  obtained  for  a  hydrogen-like  atom,  except  that  the 
quantity  Z  is  replaced  by  Z  —  Sn: 

E.  =  -£=3p™.  (26.5) 

From  (26.5)  we  can  obtain  an  expression  for  the  frequency  of  the 
K,  line 


(26.6) 
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It  follows  that  the  frequency  of  the  x-ray  spectral  lines  increases 
monotonically  as  a  function  of  the  atomic  number  Z.  This  was 
first  deduced  by  Mosley  (1914),  from  an  analysis  of  empirical  data; 
he  wrote  this  relationship  in  a  somewhat  different  form 


(26.7) 


This  formula  can  be  obtained  from  (26.6)  if  it  is  assumed  that  the 
screening  constant  for  K,  and  L  shells  is  the  same,  i.e.,  Sl  =  S1  =  S. 
We  know,  however,  that  this  is  not  quite  correct,  and,  therefore,  in 
studying  x-ray  spectra,  just  as  for  optical  spectra,  we  should 
express  the  frequencies  as  differences  of  spectral  terms.  In 
accordance  with  (26.5),  the  spectral  terms  may  be  represented  in 
the  form 


(26.8) 


This    relation   is  called  Moseley's  law;  it  is  usually  analyzed 
graphically.    Ascribing  different  values  to  the  principal  quantum 

number  n,  we  get  (see  Fig.  26.4): 
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Fig.  26.4.  Moseley  diagran 


Investigations    of    the   experimental 
curves    I/  -—^/(Z)   have   given  the 

following     average    values    for    the 

screening  constants:  Sj^l,  5,  — 3.5,  S3=10.5.1  It  has  also  been 
established  the  x-ray  spectra  change  monotonically  with  increas- 
ing Z  and  that  no  periodic  regularities  are  observed.  This  repre- 
sents a  further  difference  between  x-ray  spectra  and  optical  spectra, 
in  which  periodicity  exists  (see  Section  C). 


lSee    A.    Somnu'rfeld,    Atombau    nnd    Spcctrallinien    lr    Vieweg.,    Braunschweig,    1951, 
Chapters   4   and  5. 
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Thus,  periodic  properties  appear  only  in  valence  electrons,  and 
not  in  inner  electrons.  The  study  of  x-ray  spectra  made  it  possible 
to  prove  definitely  that  the  atomic  number  Z,  introduced  by 
Mendeleyev,  is  determined  by  the  charge  of  the  nucleus,  and  not  by 
its  mass.  It  turned  out  that  Mendeleyev  had  correctly  arranged  the 
elements  Co— Ni,  Ar— K,  Te— I  in  a  sequence  which  was  not  the 
same  as  the  order  in  which  their  atomic  weight  increased. 

There  were  also  doubts  concerning  the  correct  sequence  of  the 
rare  earths  (elements  from  Z  =  58  to  71),  whose  chemical  properties 
are  very  similar.  Moseley's  law  made  it  possible  to  verify 
their  arrangement  in  the  periodic  system.  In  addition,  the  study 
of  x-ray  spectra  made  it  possible  to  determine  the  filling  of 
the  inner  shells  of  the  ferromagnetic  metals  and  lanthanides ,  where 
the  Moseley  curve  is  discontinuous. 


C.    MULTIPLET  STRUCTURE  OF  X-RAY  SPECTRA 

X-ray  spectral  terms  are  primarily  determined  by  the  quantum 
state  characterized  by  quantum  numbers  n9  t  and  /,  from  which  the 
electron  has  been  removed,  leaving  a  "hole." 

As  a  rule,  7.5  coupling  occurs  for  outer  electrons,  whereas  // 
coupling  occurs  for  inner  electrons  (in  sufficiently  heavy  atoms). 
In  the  heaviest  atoms  ;/  coupling  begins  to  play  an  important  role 
for  outer  electrons  as  well. 

Since  the  inner  electrons  are  relatively  close  to  the  nucleus, 
they  are  mainly  under  the  influence  of  the  field  of  the  nucleus ,  and 
therefore,  their  energy  states  are  close  to  those  of  a  hydrogen-like 
atom.  We  may,  therefore,  take  the  following  formula  as  a  starting 
point: 


y  +  T 


where  the  fine  structure  constant  is 


In  order  to  account  for  the  screening  of  neighboring  electrons, 
we  must  make  the  substitution  Z-+Z  —  Snlin  (26.10).  We  assume 
in  this  approximation  that  the  screening  constant  depends  not  only 
on  n,  but  also  on  /.  Then  for  the  x-ray  spectra  we  obtain 


J!  __  1 
.i        4 

+2 
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Taking  the  approximate  square  root  we  find  a  generalization  of 
Moseley's  law  to  the  case  that  includes  relativistic  and  spin 
effects 


+    " 


This  formula  shows  that  in  Moseley's  curves  there  appears  a 
term  ~  ZA  in  addition  to  the  term  which  is  proportional  to  Z  .  The 
influence  of  this  additional  term  becomes  marked  only  at  large 
values  of  Z.  This  conclusion  is  in  agreement  with  experimental 
data. 

In  addition,  Eq.  (26.11)  explains  the  multiplet  structure  of  x-ray 
terms.  We  note,  first  of  all,  that  there  is  no  splitting  of  the  K 
term,  since  only  one  state  (lsi/a)  is  possible  (/i=l,  /  =  0,  y  —  */*)• 

For  L  terms,  we  have  three  components:   Li~2si/z(n  =  2,    /  =  0, 


/='/*). 

To    make    Eq.    (26.11)    agree    with   the   experimental   data  for 

screening  constants,  we  must  set 


We  obtain  the  following  equations  for  the  corresponding  terms: 


R 


-i/7'2"/,  _ 

V  ~^r~ 


'   —  i*      i        *     /  *7         o  \  *i    "    /  r       A  v 

— o r  T«-  (Z  —  3)  --r  (LI  terms) : 

^/  1O  4 

"T^ + iV (Z "~ 4)3  T  (Ln  terms) ;        (26- 12) 

^ 4|  j 

— is r  TK  (Z  —  4)9  -r-  (Lni  terms) . 

2        '    10  x  '     4  v  ' 


These  relations  are  represented  graphically  in  Fig.  26.5. 

In   exactly  the   same  way,  it  is  easily  shown  that  the  M  terms 
contain  five  components 


,  Af,v(3dVl), 

where    the    screening    constants   are   S;iJ  =  8,5,  5,^  =  10,  Sarf  =  13. 
There  are  seven  components  for  the  N  terms. 

The  parallel  doublet  L\  and  L\\  [see  (26.12)],  which  is  due  to 
the  screening  of  the  nucleus  by  the  electrons  is  known  as  an  irreg- 
ular doublet,  whereas  the  diverging  doublet  L\  and  L\\\  is  known 
as  a  regular  doublet.  The  reasons  for  the  adoption  of  this  terminology 
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go  back  to  the  first  stage  of  the  theory  of  multiplet  splitting  of 
x-ray  spectra.  It  may  be  recalled  that  a  theory  of  the  fine  structure 
was  first  constructed  by  Sommerfeld,  starting  with  a  relativistic 
generalization  of  the  Bohr  theory.  Sommerfeld' s  formula  gave  a 
correct  value  for  the  splitting  of  the  spectral  terms  (see  Chapter  2), 
but  neglected  the  spin  properties.  For  x-ray  spectra  this  formula  is 


where  n^  =  I  -f-  1 .  Applying  this  equa- 
tion to  the  analysis  of  the  L  terms, 
it  can  be  shown  that  these  terms  split 
into  only  two  components  which  corre- 
spond to  the  diverging  doublet  L\  and 
L\\\  in  our  notation.  Thus,  only  this 
doublet  was  explained  in  Sommer- 
feld's  relativistic  theory.  Accord- 
ingly, it  was  called  the  regular 
doublet.  The  doublet  L\  and  L\\ ,  for 
which  no  theoretical  explanation  was 
given  for  a  long  time,  was  called  the 
irregular  doublet.  With  the  advent  of 
the  Dirac  theory,  the  irregular  doublet 
also  found  an  explanation.  Therefore, 
the  reasons  for  the  adoption  of  this 
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Fig.  26.5.  Multiplet  structure  of 
the  x-ray  L  terms 


terminology  are  entirely  historical  (just  as  for  the  " normal5'  and 
"anomalous"  Zeeman  effect).  The  correct  theory  of  the  multiplet 
structure  of  x-ray  spectra  (and  of  the  anomalous  Zeeman  effect) 
was  developed  much  later,  when  the  spin  properties  of  electrons 
were  taken  into  account. 


D.    THE  DISCOVERY  OF  MENDELEYEV'S  PERIODIC  LAW 

Upon  arranging  the  known  elements  in  order  of  increasing 
atomic  weight,  Mendeleyev  discovered  that  various  chemical 
properties  tend  to  recur  quasi-periodically.  For  example,  the 
chemical  properties  of  sodium,  potassium  and  other  alkali  metals 
are  similar  to  those  of  lithium,  and  the  chemical  properties  of 
chlorine,  bromine,  iodine  and  so  on,  are  similar  to  those  of 
fluorine. 

Mendeleyev  ascribed  to  each  element  a  number  Z  (the  atomic 
number)  giving  its  position  in  the  periodic  system.  Although  the 
increase  of  Z  is  for  the  most  part  parallel  to  the  increase  of  the 
mass  number  of  the  elements,  there  are  several  exceptions— for 
instance,  (27CO— 28Ni),  (18Ar— 19K),  UTe— B3I)—  where  the  element 
with  the  smaller  atomic  number  has  a  larger  atomic  weight. 
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Moreover,  as  we  now  know,  there  exists  a  large  number  of  isotopes, 
that  is,  atoms  having  the  same  Z,  but  different  masses  (for  ex- 
ample, 1H',  ,D\  J')- 

The  periodic  law  has  acquired  particular  significance  in  con- 
nection with  recent  discoveries  concerning  atomic  and  nuclear 
structure.  In  particular,  the  study  of  x-ray  spectra  and  experi- 
ments on  the  scattering  of  a  particles  by  atoms  have  established 
that  the  atomic  number  Z  characterizes  the  charge  of  the  nucleus 
(and  the  number  of  electrons  in  the  neutral  atom). 

Sixty-three  elements  were  known  when  Mendeleyev  discovered 
the  periodic  law  (1869).  He  predicted  the  existence  of  ten  more 
elements,  and  even  described  the  basic  chemical  and  physical 
properties  of  three  elements  which  were  subsequently  discovered, 
namely,  scandium  (>i$c),  gallium  dGa)  and  germanium  (l)2Ge).  The 
inert  (noble)  gases  were  discovered  at  the  end  of  the  19th  century. 
In  Mendeleyev's  time,  only  three  elements  from  the  rare  earth 
group  (lanthanides)  were  known:  cerium,  didymium  (a  mixture  of 
praseodymium  and  neodymium)  and  erbium.  At  the  present  time 
the  properties  of  all  fourteen  rare-earth  elements  have  been 
investigated. 

By  1937,  ninety-two  elements  were  known,  but  four  of  these  had 
not  yet  been  observed.  It  was  later  found  that  these  four  elements 
were  radioactive  and  virtually  nonexistent  in  nature.  They  were 
produced  in  the  laboratory  as  a  result  of  nuclear  reactions. 

In  1937,  E.  Segre  produced  an  element  with  Z^=43,  called  tech- 
netium,  by  neutron  bombardment  of  molybdenum.  The  half-life  of 
its  most  stable  isotope,  «3Tc<J7 ,  was  found  to  equal  2.6  x  10fl  years. 

In  1938,  it  was  first  reported  that  an  isotope  of  the  last  rare- 
earth  element  with  Z  =  6l  had  been  produced  as  a  result  of  deuteron 
bombardment  of  neodymium.  The  half-life  of  the  most  stable 
isotope  of  this  element,  GiPm!" ,  is  about  20  years. 

In  1940,  E.  Segre  discovered  an  element  with  Z-^85  by  irradi- 
ating bismuth  with  a  particles.  He  called  this  element  astatine.  The 
half-life  of  its  most  stable  isotope,  ssAt*10,  is  8.3  hours. 

A  short-lived  element  with  Z~<S7,  called  francium,  was  dis- 
covered in  1939  by  a  Frenchwoman,  Mile.  Perey.  This  element  is 
produced  in  the  «  decay  of  8.jAc^7.  The  half-life  of  its  most  stable 
isotope,  «7Fr2Wf  is  equal  to  22  minutes. 

And  finally,  we  must  mention  that  with  the  development  of 
nuclear  physics,  it  has  become  possible  to  produce  transuranium 
elements.  These  range  from  neptunium  (Z=^93)  to  lawrencium 
(Z  —  103).2  Thus,  the  periodic  system  now  consists  of  103  elements 
without  any  intervening  gaps. 


2 
The  largest  number  of  transuranium  elements  has  been  discovered  by  G    Seaborg  and 

his    students      The   discovery   of  lawrencium  (Lw)  with   Z  —  103   was   recently   announced. 
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E.    FILLING  OF  THE  ELECTRON  SHELLS 

In  quantum  mechanics,  the  levels  of  the  electron  shells  are 
filled  in  accordance  with  the  following  rules. 

(a)  There   can  be  at  most  one  electron  in  each  quantum  state 
(Pauli's  exclusion  principle),  and  therefore  the  maximum  number  of 
electrons   with  a   given   /   is   equal   to  2(21+1)  (see  Chapter  25). 
Thus  the    s,  p,    d  and  /  subshells  can  contain  no  more  than  2,  6,  10 
and  14  electrons,  respectively. 

(b)  Electrons  tend  to  occupy  the  lowest  energy  levels.  There- 
fore, the  shells  withn  =  1  will  be  filled  first,  then  those  with  n  — 2, 
H  =  3,and  so  on. 

The  shells  would  be  filled  in  this  way  in  an  ideal  scheme  in 
which  the  wave  function  of  an  atomic  electron  could  be  calculated 
on  the  basis  of  the  assumption  that  the  charge  of  the  nucleus  and 
the  charges  of  the  Z  —  f  electrons  are  all  located  at  the  center.  In 
this  case  the  energy  levels  of  the  remaining  electron  would  be  the 
same  as  in  the  hydrogen  atom,  and  therefore  they  would  be  degen- 
erate with  respect  to  /.  As  was  shown  in  the  investigation  of  alkali 
metals,  however,  the  distribution  of  the  electrons  over  a  finite 
volume  removes  the  /  degeneracy,  so  that  terms  with  a  fixed  value 
of  the  principal  quantum  number  n  (that  is,  the  terms  in  a  specific 
shell)  are  arranged  in  order  of  increasing  /.  The  s  term  is  there- 
fore filled  first,  then  the  p  term,  and  finally,  the  d  term. 

Moreover,  the  4s  subshell  is  located  below  the  3d  subshell  (and 
5s  is  below  4rf),  while  the  6s  subshell  is  below  both  the  5c/  subshell 
and  the  4/  subshell  (similarly,  7s  is  below  5/).  It  turns  out  that  the 
outer  shell  (in  an  unexcited  atom)  can  consist  only  of  s  and  /;  sub- 
shells.  The  d  and  /  subshells  can  be  filled  when  they  lie  in  the 
first  or  second  inner  shell,  respectively  (the  first  inner  shell  is 
taken  to  be  the  shell  directly  adjacent  to  the  outer  shell)?  We 
shall  make  an  attempt  to  substantiate  this  by  an  investigation  of 
the  ground-state  configuration  of  the  electrons  in  individual  atoms 
(see  Fig.  26.6). 


The  order  in  which  the  electron  subshells  are  formed  can  be  remembered  most  simply 
with  the  help  of  the  following  empirical  rule:  the  levels  are  filled  in  the  order  in  which  the 
sum  of  the  principal  and  orbital  quantum  numbers,  n  \  /,  increases,  and  levels  with  the 
same  value  of  this  sum  are  filled  in  order  of  increasing  n  Since  I  takes  the  values 
0,  1,  2,  ,  n  —  1,  we  can  find  the  rule  for  filling  the  terms  in  any  shell  For  example,  the 
fourth  period  (see  below)  will  be  filled  in  the  order 

4s  (rat/- 4),       3d(nt/=5),      4p(n  +  /      5) 
and  the  sixth  period  in  the  order 

6s  (n  f  /  =  6),       4f(n  +  / --  7), 
Sd(n  +  l-=7),       6p(rc  +  I  =  7). 
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Within  the  first  and  second  periods  of  Mendeleyev's  system, 
the  order  in  which  the  levels  are  filled  conforms  to  the  sequence 
of  levels  in  the  hydrogen  atom  (the  ideal  scheme).  If  this  ideal 
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Fig.  26.6.  Diagram  of  the  filling  of  energy 
levels  in  the  periodic  system  of  elements. 
Only  s  and  p  subshells  can  be  present  in  the 
outer  shell  of  an  atom.  A  d  subshell  can  be 
filled  only  beginning  with  the  first  inner 
shell.  An  f  subshell  can  be  filled  only  be- 
ginning with  the  second  inner  shell.  The 
filling  of  the  3d  subshell  gives  the  ferro- 
magnetic elements  (Fe,  Co,  Ni).  The  fil- 
ling of  the  4/"  subshell  gives  the  lantha- 
nides  or  rare  earths  (58Ce  —  7jLu).  The 
filling  of  the  5f  subshell  gives  the  actinides 
(90Th  -  103).  The  asterisks  denote  the  atom- 
ic numbers  of  elements  which  have  not  yet 
been  discovered.  The  maximum  number  of 
electrons  in  a  given  shell  or  subshell  is  in- 
dicated in  parentheses.  Emanation  (Em)  (the 
element  with  Z  —  86)  is  also  called  radon  (Rn). 


•1®}K(2) 


scheme  were  applicable  to  complex  atoms,  the  3d  subshell  would 
start  to  be  filled  beginning  with  potassium  (Z=19).  According  to 
the  table  given  in  Chapter  25,  however>  5^  =  0,140,  and  8S  =  2,23,  for 
potassium,  and  therefore  the  energy  of  the  electrons  in  the  3d  and 
4s  states  will  be 


(3  —  0.146)* 


2,854*' ' 
T.77r' 


(26.13) 


It  can  be  seen  that  E9d^>Eis,  and  hence  the  4s  level  will  be  filled 
before  the  3d  level.  Consequently,  the  third  period  will  contain  only 
seven  elements  (nNa  —  i8Ar),  just  like  the  second  period. 


MENDELEYEV'S    PERIODIC    SYSTEM    OF    ELEMENTS  431 

After  the  4s  subsheli  is  filled  in  Ca  (Z  =  20),  one  might  expect 
that  the  filling  of  the  subsheli  would  begin  with  scandium  (Z  =  21). 
Spectroscopic  data  show,  however,  that  in  elements  9iSc  —  28Ni  the 
3d  subsheli  is  filled  first.  This  subsheli  becomes  filled  at  the  ex- 
pense of  electrons  from  2nCu  to  30Zn  of  the  4s  subsheli,  which  then 
must  be  refilled  after  all  available  states  in  the  3d  subsheli  are  oc- 
cupied. Only  after  that  can  the  refilling  of  the  4p  subsheli  start. 
Thus  the  fourth  period  contains  18  elements  and  consists  of  the  4s, 
3d  and  4p  subshells  (see  Fig.  26.6).  The  fifth  period  repeats  the 
fourth  period  (the  5s,  4d,  5p  subshells  are  filled),  and  thus  it  also 
contains  18  elements  GvRb  — 84Xe). 

The  sixth  period  contains  32  elements  (89Cs  — 86Rn),  because 
besides  the  outer  shell,  consisting  of  6s  and  6p  states,  the  first 
inner  subsheli  (ten  electrons)  and  the  second  inner  subsheli  4/  (the 
14  electrons  of  the  lanthanide  or  rare- earth  elements)  will  be  filled. 

In  exactly  the  same  way  the  seventh  period  should  repeat  the 
sixth  period;  that  is,  it  should  contain  32  elements  (the  7s,  5/,  6d, 
Ip  subshells).  So  far,  however,  only  17  elements  of  this  period 
have  been  discovered.  The  so-called  actinides,  in  which  the  second 
inner  subsheli  5/  is  filled  (90TH—  element  103)  should  have  properties 
similar  to  those  of  the  lanthanides. 

The  first  period  therefore  contains  two  elements,  the  second  and 
third  periods  eight  elements  each,  the  fourth  and  fifth  periods  18  ele- 
ments each,  and  the  sixth  and  seventh  period  32  elements  each 
(except  that  the  seventh  period  is  incomplete).  The  order  in  which 
the  states  are  filled  is  illustrated  in  Fig.  26.6. 


F.    APPLICATION  OF  THE  THOMAS- FERMI  METHOD  TO  THE 
THEORY  OF  THE  PERIODIC  SYSTEM  OF  ELEMENTS 

We  shall  now  attempt  to  treat  the  ground-state  configuration  of  the  elements  more 
rigorously. 

In  a  paper  devoted  to  the  statistical  theory  of  the  atom  ( 1928),  Fermi  proposed  a 
method,  now  known  as  the  Thomas-Fermi  method  (see  Chapter  25),  to  explain  the  periodic 
system  of  elements.  With  this  method,  he  obtained  the  minimum  values  of  Z  for  which 
s,  p,  d  and  /  states  are  filled  in  atoms.  He  obtained  these  values  by  starting  from  the 
following  quasi-classical  ideas. 

In  classical  theory  the  angular  momentum  of  a  particle  L  is  related  to  the  momentum 
p  by  the  expression 

L  =  rx  p. 

Consequently, 

rt^Ts. 

where  pn  is  the  component  of  momentum  perpendicular  to  the  position  vector  r. 

Obviously,  the  square  p^  of  this  component  of  momentum  cannot  be  greater  than  the 
square  of  the  maximum  momentum,  which  we  shall  denote  by  P.  Therefore  for  a  given  P 
and  r  the  possible  values  of  the  angular  momentum  L  must  satisfy  the  inequality 

/*>-.  (26.14) 
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It  was   shown  in  Chapter   13  that,  in  the  quasi-classical  treatment  of  the  atom,  the 
square  of  the  angular  momentum  must  be  [see  (13.62)] 

(26.15) 

This  relation  represents  a  compromise  between  the  Bohr  relation  I*  —  W  (/  +  I  )a  and 
the  quantum  mechanical  relation  L*  =  W[  (I  +  1)  .  b 

The  maximum  momentum  P  is  related  to  the  density  of  the  electron  gas  (electrons  in 
the  atom)  p0by  the  expression  (6.32) 

/»=a*(3*»p0)a/».  (26.16) 

The  electron  density  po  may  be  found  from  the  Thorn  as-  Fermi  equation  (see  Chapter  25), 
which,  as  we  have  already  indicated,  can  be  solved  only  by  approximate  or  numerical 
methods.  A  good  approximation  for  p0,  as  follows  from  a  solution  of  the  Thomas-  Fermi 
equation,  is  provided  by  the  expression  [see  (25.21)] 


where  the  coefficient  I  is  found  bv  the  Ritz  variational  method. 

Substituting  these  values  for  P*and  L*  into  the  inequality  (26.14),  we  get 


Introducing  the  new  variable  hr  =  x,  we  find 


(26.18) 


£_  (26.19) 

x  ' 


where 

''- 

From  the  inequality  (26.19)  it  is  evident  that  the  right-hand  side  of  (26.19)  becomes 
greater  than  the  left-hand  side  as  A:  —  0  (r  —  0)  and  as  x  —  •>  GO.  The  electrons  in  the  atom 
therefore  can  have  a  given  value  of  /  if  x  lies  in  the  range  x\  <.x<.  xa  for  which  the 
inequality  (26.  19)  is  satisfied.  Here  Xi  and  xa  are  roots  of  the  equation 

e~  I'^^O  (26.21) 

x  ' 

The  condition  for  the  appearance  of  states  with  a  given  value  of  /  is  the  equality  of 
both  roots 


In  this  case  we  should  equate  not  only  the  two  functions  themselves,  but  also  their  de- 
rivatives. Then,  in  addition  to  Eq.  (26.21),  we  will  have 


'        * 


(26.22) 


MENDELEYEV'S    PERIODIC    SYSTEM    OF    ELEMENTS 
These  two  relations  will  be  satisfied  for 
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that  is,  when 


Substituting  the  value  for  D  from  (26.20),  we  find  the  value  of  Z  at  which  electrons  with  a 
given  /  will  first  appear 


z  =  -~r(2/-hD3  =  -r(2/+i)8, 


(26.23) 


where  e  =  2.718 ...  is  the  base  of  the  natural  logarithms  and  the  coefficient  7  is  equal  to 
0.158. 

A  numerical  solution  of  the  Thomas-Fermi  equation  gives  a  very  similar  value  for  7 


This  again  is  a  convincing  demonstration  that  the  density  (26,17)  represents  a  good 
approximation  to  the  density  which  is  given  by  a  numerical  solution  of  the  Thomas- 
Fermi  equation. 

Equation  (26.23)  enables  us  to  calculate  the  Z  values  at  which  the  s,  p9  dm  and  /states 
begin  to  be  filled.  The  results  of  this  calculation  are  given  in  Table  26. 1.  The  first  row 
gives  fractional  values  of  Z  computed  from  formula  (26.23)  with  7T-F=S:0«155.  The 
values  of  Z,  calculated  with  7  T_F  =  0.158,  for  which  s,/?,  d,  and  /states  first  appear 
are  practically  identical  with  those  calculated  with  7  1_p  =  0.155.  The  second  row  gives 
the  nearest  greater  integral  value  of  Z.  The  last  row  of  the  table  gives  the  empirical 
values  of  Z  at  which  the  states  first  appear,  and  also  the  symbol  of  the  corresponding 
elements. 

Table  26.1 
Atomic  numbers  at  which  the  s,p,d,  and  /states  first  appear  for  a  given  / 


s 

p 

d 

/ 

0 

1 

2 

3 

Theoretical  value 

j  0.15 

4.2 

19.4 

53.2 

(Thomas-Fermi) 
Empirical  value 

Z  1  (H) 

5 
5(B) 

20 
21  (Sc) 

54 
58  (Ce) 

From  this  table  it  can  be  seen  that  this  approximate  theory  is  in  good  agreement 
with  the  experimental  data.  We  may  note  that  complete  agreement  is  obtained  if  the  co- 
efficient 7  is  taken  to  be  0. 169  instead  of  0. 155. 

It  is  well  known  that  in  light  elements  (Z  —  1,  2,  3,  4)  only  s  states  are  filled.  The 
filling  of  the  p  states  begins  with  boron  (Z  =  5);  this  is  in  complete  agreement  with 
theoretical  data.  Table  26.1  shows  (in  spite  of  the  crudeness  of  the  statistical  model) 
that  the  filling  of  the  3d  subshell  does  not  begin,  as  might  be  expected,  in  potassium 
(Z  =  19),  but  in  scandium  (Z  =  21);  that  is  to  say,  it  does  not  begin  until  the  4s  subshell 
is  completely  filled.  Similarly,  the  Thomas-Fermi  model  explains  the  "delay"  in  the 
filling  of  the  4/  subshell,  which  might  be  expected  to  begin  in  Ag  (Z  =  47).  According  to 
the  theory,  however,  the  filling  of  the  4/  subshell  should  be  shifted,  and  should  begin  only 
in  cerium  (Z  =  58).  It  follows  from  (26.3)  that  the  filling  of  the  5g  subshell  (/  =  4)  would 
begin  in  the  element  with  Z  =  1 24 

The  Thorn  as- Fermi  model  accounts  for  a  very  important  feature  of  the  ground-state 
configuration  of  atoms  and  explains  the  departure  of  the  filling  of  the  levels  from  the 
ideal  scheme  (the  hydrogen  scheme)  in  terms  of  the  ''smearing"  of  the  electron  cloud. 
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G.   PERIODICITY  IN  THE  PROPERTIES 
OF  ELEMENTS 

The  periodicity  in  the  properties  of  the  elements  can  be  ex- 
plained quite  naturally  in  quantum  mechanics.  It  is  connected  with 
the  periodic  nature  of  the  filling  of  the  outer  shell,  which  contains 
at  most  eight  electrons  ($  and  p  states)  and  which  determines  the 
chemical  and  optical  properties  of  atoms.  All  elements,  therefore, 
can  be  divided  into  eight  groups,  depending  on  the  number  of  elec- 
trons in  the  outer  shell. 

The  elements  in  Group  I  (hydrogen  and  alkali  metals)  have  an 
outer  shell  containing  a  single  electron.  As  a  result,  the  optical 
terms  (except  the  s  term)  are  doublets  and  the  elements  are 
monovalent,  as  will  be  shown  below.  The  elements  of  Group  II— the 
alkaline  earth  metals  (beryllium,  magnesium,  calcium,  etc.)-have 
two  valence  electrons;  their  spectral  terms  must  therefore  be 
singlets  and  triplets,  and  the  valence  is  equal  to  two.  The  elements 
of  Group  III  have  an  outer  shell  containing  three  electrons,  and 
therefore  the  maximum  splitting  of  their  optical  terms  must  be 
equal  to  four  (quartets);  their  maximum  valence  is  three. 

On  the  contrary,  the  elements  of  Group  VII— the  halogens 
(fluorine,  chlorine,  etc.)— lack  just  one  electron  to  fill  the  outer 
shell  completely.  Therefore,  in  addition  to  the  maximum  (positive) 
valence  of  seven,  they  may  have  a  negative  valence  of  —  1  (the 
number  of  electrons  required  to  obtain  a  stable  configuration). 
They  exhibit  this  valence  in  the  so-called  ionic  compounds  (see 
Chapter  27). 

Finally,  in  the  inert  gas  group  (neon,  argon,  krypton,  etc.)  the 
outer  shell  is  completely  filled.  We  may  even  say  that  there  is  no 
outer  shell  in  these  elements,  because  the  energy  bonding  these 
electrons  in  the  outer  shell  is  larger  than  that  bonding  all  other 
shells  in  the  molecule  and  thus  it  would  be  more  correct  to  ascribe 
it  to  the  inner  shell.  Thus,  these  elements  may  be  assigned  to  the 
"zeroth"  group.  The  elements  of  the  zeroth  group  do  not  as  a 
rule  enter  into  any  chemical  reactions,  and  thus  these  elements 
are  said  to  be  chemically  inactive. 

There  are,  however,  a  number  of  exceptions  to  the  rule  that 
there  are  eight  elements  in  each  period.  The  first  exception  is 
constituted  by  hydrogen  (Z=  1)  and  helium  (Z  =  2)t  which  form  the 
first  period.  In  this  period  there  are  only  two  elements,  and  not 
eight.  This  is  due  to  the  fact  that  the  /<  shell  does  not  include  p 
states.  Consequently,  the  properties  of  these  elements  are  of  a 
dual  nature.  Because  there  is  only  one  electron  in  the  outer  shell, 
hydrogen  should  have  the  same  chemical  and  optical  properties  as 
the  alkali  metals.  Indeed,  just  like  in  these  elements,  the  maximum 
splitting  of  the  spectral  terms  of  hydrogen  is  two,  and  its  valence 
is  one.  Hydrogen,  however,  also  resembles  the  halogen  group  in 
that  it  lacks  just  one  electron  for  a  complete  outer  shell,  and  it  can 
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therefore  acquire  a  second  electron  forming,  in  the  same  way  as 
the  halogens,  a  negatively  charged  ion. 

Helium  resembles  the  alkaline  earth  metals  of  the  second  group 
in  that  it  also  has  two  electrons  in  the  outer  shell.  The  spectral 
terms  of  both  helium  and  the  alkaline  earth  elements  are  either 
singlets  (spin  0),  or  triplets  (spin  1).  However,  in  its  chemical 
properties,  helium  is  a  typical  representative  of  the  inert  gases, 
because  its  outer  K  shell  is  completely  filled;  hence  it  does  not 
participate  in  normal  chemical  reactions. 

The  maximum  valence  of  elements  is  determined,  as  a  rule,  by 
the  number  of  electrons  in  the  outer  shell;  that  is,  the  valence  of 
atoms  varies  from  unity  (for  atoms  of  the  first  group)  to  seven 
(halogen  group).  There  are,  however,  certain  exceptions  among  the 
elements  in  which  the  inner  shells  are  filled  after  the  outer  shells. 

It  can  be  seen  from  the  periodic  table  that  there  will  be  two 
electrons  in  the  outer  shell  of  all  elements  from  scandium  (Z  =  21) 
to  nickel  (Z  =  28),  with  the  exception  of  chromium,  where  there  is 
only  one.  Owing  to  transitions  of  electrons  from  M  states  to  4p 
states,  however,  the  maximum  valence  of  scandium  (Z  =  21)  is  equal 
to  three,  and  that  of  manganese  (Z  =  25)  is  equal  to  seven.  Conse- 
quently, it  was  necessary  to  place  these  elements  in  groups  corre- 
sponding to  their  maximum  valences  (this  was  correctly  done  by 
Mendeleyev).  Mendeleyev  placed  iron  (Z  =  26)f  cobalt  (Z  =  27)  and 
nickel  (Z  =  28)  in  a  special  group  (Group  VIII).  The  introduction  of 
this  group  is  justified  from  the  point  of  view  of  modern  quantum 
mechanics,  since  at  most  eight  electrons  can  occupy  the  outer 
shell.  In  general,  however,  iron  behaves  either  as  a  bivalent  or 
trivalent  element.  All  of  these  elements  have  similar  properties. 
In  particular,  they  have  distinctive  ferromagnetic  properties  caused 
by  uncompensated  spins  of  the  3d  electrons  in  the  inner  shell.  The 
presence  of  these  states  is  due  to  the  fact  that  from  the  energy 
standpoint,  the  3d  state  is  more  favorable  during  formation  of  the 
crystal  lattice  than  the  other  states  in  which  the  spins  of  the  elec- 
trons can  be  compensated. 

In  the  elements  following  the  ferromagnetic,  the  3d  subs  hell  is 
the  first  to  be  completely  filled;  the  filling  of  the  levels  then  con- 
tinues in  the  4s  and  then  the  4/j>  subshells.  Krypton  completes  the 
structure  of  the  M  shell  (n  =  4) ;  therefore  its  optical  and  chemical 
properties  will  be  characteristic  of  the  inert  gases. 

As  we  have  already  mentioned,  the  fifth  period,  which  extends 
from  the  alkali  metal  rubidium  (Z  =  37)  to  the  inert  gas  xenon 
(Z  =  54),  is  a  repetition  of  the  fourth  period  and  exhibits  no  new 
features. 
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Quantum  theory  also  explains  the  characteristic  properties  of 
the  elements  in  the  lanthanide  series  (the  rare-earth  elements), 
which  comes  immediately  after  lanthanum  and  extends  from  cerium 
(7  =  58)  to  lutetium  (Z  — 71).  The  elements  in  this  series  are 
formed  by  consecutive  addition  of  electrons  to  the  deeper  4/  sub- 
shell  (the  second  inner  N  shell),  even  though  the  first  inner  shell 
(0)  and  the  outer  shell  (P)  are  still  incompletely  filled.  Since  the 
chemical  properties  are  determined  mainly  by  the  electrons  of  the 
outer  shells,  all  14  rare-earth  elements  are  much  closer  with  re- 
gard to  chemical  properties  than  are  the  elements  in  which  the 
first  inner  d  subshell  is  filled. 

For  a  long  time  hafnium  (Z  — 72)  was  also  included  in  the 
lanthanide  series.  A  theoretical  analysis  performed  by  Bohr 
showed,  however,  that  there  can  be  at  most  fourteen  elements  in 
this  group  (the  possible  number  of  /-states)  and  that,  therefore, 
hafnium  must  be  a  chemical  analog  of  zirconium.  Careful  exper- 
iments have  confirmed  this  theoretical  conclusion. 

The  actinide  series  in  the  seventh  and  last  period  is  analogous 
to  the  lanthanide  group.  Beginning  with  thorium  (Z  =  90) ,  the  ele- 
ments of  this  series  are  formed  by  the  consecutive  addition  of 
electrons  to  the  deep- lying  5/  subshell  of  the  0  shell,  while  the  6s, 
6;;  and  7s  subshells  remain  completely  filled  and  the  6d  subshell  is 
partially  filled.  The  actinides  include  protactinium  (Z  =  91), uranium 
(Z  =  92)  and  also  the  following  artificially  produced  transuranium 
elements:  neptunium  (Z  =  93),  plutonium  (Zr^94))americium  (Z=95), 
curium  (Z  =  97),  californium  (Z  =  98),, einsteinium  (Z=  99),  fermium 
(Z=100),  mendelevium(Z=101),  nobelium  (Z=102)  and  the  recently 
discovered  element  103  (lawrencium)? 

The  question  of  how  many  elements  can  be  produced  by  artificial 
means  and  experimentally  detected,  and  the  question  of  where  the 
periodic   system   ends,    have  not  yet  been  finally  answered.  It  is 
clear,  however,  that  the  periodic  system  ends  because  of  he  in- 
stability of  nuclei  (due  mainly  to  their  spontaneous  fission). 


The    discovery   and  properties  of  the  new  elements   are   discussed   in  G.  T.    Seaborg, 
TtiL>  Transuranium  Elements,  Reading,  Mass.,  Addison- Wesley,  1958 


Chapter  27 


The  Theory  of  Simple  Molecules 


A.  BASIC  FORMS  OF  THE  CHEMICAL  BOND 

The  chemical  properties  and  the  optical  spectrum  of  an 
element  are  determined  mainly  by  the  outer  electrons  of  the 
atom.  Therefore,  the  regularities  in  the  structure  of  the  outer 
shell,  which  account  for  the  optical  periodicities,  also  provide 
a  basis  for  the  construction  of  a  theory  of  the  periodically  re- 
current chemical  properties  of  the  elements.  It  should  be  noted 
that  the  chemical  properties,  unlike  the  optical  properties,  are 
not  exhibited  by  isolated  atoms,  but  appear  only  in  the  presence 
of  other  atoms,  with  which  the  atom  forms  chemical  compounds. 

The  inner  electrons  have  almost  no  influence  on  chemical 
processes,  since  they  are  much  more  strongly  bound  to  the 
nucleus  than  the  outer  electrons.  Chemical  reactions  therefore 
liberate  much  less  bonding  than  the  energy  of  the  inner  elec- 
trons. 

In  discussing  the  chemical  properties  of  an  atom,  we  must 
distinguish  between  two  main  types  of  chemical  bonds:  ionic  (or 
heteropolar)  and  atomic  (or  homopolar).  We  shall  consider  both 
of  these  types  in  greater  detail. 


B.    HETEROPOLAR  MOLECULES 

Inorganic  salts  consist  of  positive  and  negative  ions  held  to- 
gether by  an  electrostatic  (Coulomb)  attraction  to  form  a  mole- 
cule. Compounds  of  this  type  are  called  ionic,  and  their  molecules 
are  said  to  be  heteropolar.  The  ions  may  be  either  positive  or 
negative.  The  sign  of  the  charge  on  the  ion  depends,  on  the  one 
hand,  upon  the  ionization  potential,  that  is,  the  energy  that  must 
be  expended  in  order  to  remove  an  electron  from  the  outer 
shell;  and,  on  the  other  hand,  on  the  electron  affinity,  that  is, 
the  energy  which  the  atom  must  acquire  to  hold  an  additional 
electron  in  the  outer  shell. 

Let  us  assume  that  a  neutral  atom  with  atomic  number  Z 
contains  N  electrons  in  the  inner  orbitals  and  Za  —  Z  —  N  elec- 
trons in  the  outer  orbitals.  The  electrons  in  the  inner  orbitals 
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will  completely  screen  a  corresponding  fraction  of  the  nuclear 
charge,  but  will  do  so  only  in  the  region  outside  this  inner  shell 
(starting  at  the  outer  shell).  Thus,  the  Coulomb  potential  energy 
holding  the  electrons  in  the  outer  shell  is 

y  _      zae* 


However,  inside  the  atomic  core  the  charge  will  be  Z,>Zrt;  that 
is,  the  screening  of  the  nuclear  charge  will  not  be  complete  (see 
Chapter  25).  In  exactly  the  same  way,  the  outer  electrons  will 
completely  screen  the  remaining  part  of  the  nuclear  charge  (Zrte0 ) 
only  in  the  region  outside  the  outer  shell  (that  is,  only  in  the  case 
of  excited  states).  While  in  this  case  there  appear  polarization 
forces  proportional  to  r~B,  they  are  not  able  to  hold  an  additional 
electron.  In  the  outer  shell  itself  the  charge  will  be  incompletely 
compensated;  for  this  reason  (but  provided  there  are  unfilled 
states  in  the  outer  shell),  the  incompletely  screened  part  of  the 
nuclear  charge  will  hold  additional  electrons  in  this  shell,  thus 
forming  a  negative  ion.  The  rule  is  that  the  less  electrons  there 
are  in  the  outer  shell  of  a  neutral  atom,  the  larger  the  total 
screening  of  the  nuclear  charge  in  this  shell.  Therefore,  an  alkali 
metal  will  lose  the  one  electron  in  its  outer  shell  more  readily 
than  it  will  acquire  additional  electrons. 

A  curve  showing  the  dependence  of  the  ionization  potential  on 
Z  is  plotted  in  Fig.  27.1.  It  shows  a  minimum  for  alkali  metals  and 
a  maximum  for  inert  gases.  This  curve  reproduces  rather  faith- 
fully the  periodicity  exhibited  by  the  number  of  electrons  in  the 
outer  shell. 

In  inert  gases  the  ionization  potential  reaches  its  largest 
value;  the  removal  of  an  electron  from  the  outer  shell  and  its 
transfer  to  another  atom  require  a  very  large  expenditure  of 
energy.  In  addition,  no  further  electrons  can  be  held  in  the  outer 
shell,  which  is  completely  filled.  Therefore,  inert  gases  do  not 
participate  in  ordinary  heteropolar  compounds  (we  shall  also  see 
that  they  do  not  form  homopolar  bonds),  and  hence,  as  a  rule, 
they  exist  as  unassociated  atoms. 

Atoms  of  alkali  and  alkaline  earth  metals  readily  give  up  their 
valence  electrons  to  another  atom  (the  ionization  potential  is  at 


For  example,  in  sodium  ('/,  11),  the  ten  electrons  in  the  inner  shell  completely 
screen  ten  units  of  nuclear  charge,  which  leaves  only  the  eleventh  unit  of  nuclear  charge 
to  be  (partially)  screened  by  the  outer  electron.  In  chlorine  (Z  -=  17)  the  ten  inner  elec- 
trons completely  screen  only  ten  units  of  nuclear  charge,  so  that  the  seven  electrons  of 
the  outer  shell  must  screen  the  remaining  charge,  which  they  can  accomplish  only  partially. 
Therefore,  a  chlorine  atom  is  able  to  hold  an  additional  electron  more  easily  than  sodium, 
and  is  thus  converted  into  the  negative  ion  Cl  .  On  the  other  hand,  a  sodium  atom  gives 
up  its  outer  electron  more  readily,  and  in  this  way  forms  a  positive  ion  Na+. 
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its  minimum  here),  and  thus  convert  to  positive  ions  (for  example, 
a  Na+  ion). 


Em 


10       20      30       40      50      60      70      SO      90  Z 


Fig.   27.1.  The  dependence  of  the  lomzation  po- 
tential  of  a   neutral    atom  on   the   atomic  number 

On  the  contrary,  atoms  in  Group  VI  (including  oxygen)  and  in 
Group  VII  (halogens),  and  also  hydrogen  (which  resembles  Group 
VII  with  regard  to  the  number  of  missing  electrons),  have  a 
higher  electron  affinity  than  the  other  elements  (see  Table  27.1). 
The  electron  affinity  of  sodium  is  practically  equal  to  zero,  like 
that  of  inert  gases. 

Table  27.1 
Electron  Affinity 


Element 

Electron  affinity 
(ev) 

H 
F 
Cl 
0 

0.71 
4.13 
3.72 
3.07 

The  first  successful  attempt  to  construct  a  theory  of  the  ionic 
bond  was  due  to  Kossel  (1916),  who  made  use  of  the  Bohr  theory 
of  the  atom. 

Kossel's  theory  was  based  on  the  fact  that  the  eight- electron 
shells  of  the  inert  gases  are  closed,  so  that  these  atoms  have 
zero  valence.  Positive  valence  (or  valence  with  respect  to  hydro- 
gen) is  determined  by  the  number  of  electrons  in  the  outer  shell 
in  atoms  in  which  these  electrons  are  readily  lost  (atoms  of 
Groups  I  or  II).  Negative  valence  (valence  with  respect  to  fluorine 
or  twice  the  valence  with  respect  to  oxygen)  is  determined  by  the 
number  of  electrons  which  the  atom  can  acquire,  that  is,  the 
number  of  vacant  states  in  the  outer  shell  (see  Chapter  26). 
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Negative  valence  is  particularly  pronounced  in  elements  of  Groups 
VI  and  VII,  although  both  types  of  valence  may  be  exhibited  by  a 
given  element.  For  example,  in  the  typical  heteropolar  compound 


Fig.  27.2.  Two  neutral  independent  atoms 
(Na  and  Cl).  The  black  dots  indicate 
electrons,  the  light  dot  indicates  a  state 
which  can  be  occupied  by  an  electron 
owing  to  the  electron  affinity  of  the  atom. 

HClt  chlorine  has  a  valence  of  -lf  although  other  compounds  are 
possible  in  which  chlorine  has  a  valence  of  +7.  An  example  of 
the  latter  is  C12O7.  We  do  not  intend  here  to  develop  a  complete 
theory  of  the  chemical  bond,  and  shall  restrict  ourselves  to  an 
examination  of  one  typical  ionic  molecule,  namely,  the  hetero- 
polar molecule  Nad.2  The  energy  bonding  an  outer  electron  in 
atomic  sodium  is  5.1  ev.3  When  the  valence  electron  of  sodium  is 
transferred  to  the  outer  shell  of  the  chlorine  [that  is,  during 
formation  of  Na+  and  Cl"  ions  (see  Fig.  27.2)],  it  carries  some- 
what less  bonding  since  its  affinity  for  chlorine  (W&  =  3.7  ev) 
is  somewhat  lower  than  the  ionization  energy  of  sodium  ( W  ^  = 

5.1  ev).  However,  in  the  formation  of  the  molecule,  this  deficiency 
is  compensated  by  the  Coulomb  energy  of  attraction  between  the 
Na+  and  Cl~  ions  (see  Fig.  27.3). 

The  total  energy  bonding  these  atoms  in  the  NaCl  molecule  is 
given  by  the  expression 


•rylOn 

—  M^Na   • 


This  energy  has  been  very  carefully  determined  from  experiment 
and  found  to  be  equal  to:   UP$dCI  =  4.2  ev-    Hence  for  the  Coulomb 


2See  G   Herzberg,  Atomic  Spectra  and  Atomic  Structure,  New  York,  1944 

The  bonding  energy  of  an  electron  in  an  atom  (or  molecule)  is  equal  to  the  energy  that 
must  be  expended  in  order  to  remove  an  electron  It  is,  therefore,  equal  to  the  negative  of 
the  energy  holding  the  electron  in  the  compound  (W  -  -  V),  that  is,  it  will  be  a  positive 
quantity 
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7{f  _|_  u/N°an  -f  lP?Xa  =  5.6  ev: 


Since  VCoul=— U?Coul=  -~^,    we  obtain  a  perfectly  reasonable  value 

for  the  interatomic  distance  in  the  NaCl  molecule:  R=  2.5  -  10~8 
cm. 


(3.7  ev) 


Fig.  27  3  The  formation  of  an  NaCl  molecule 
from  Na^  and  Cl~  ions  The  lonization  poten- 
tial of  sodium  (5  1  ev)  and  the  affinity  of  the 
chlorine  atom  for  an  electron  (3  7  ev)  are  indi- 
cated in  parentheses  The  Coulomb  bonding 
energy  between  the  ions  in  the  molecule  is 
5  6  ev. 

It  should  be  noted  that  we  have  not  considered  here  all  the  in- 
teractions that  occur  in  a  heteropolar  molecule.  In  addition  to  the 
Coulomb  forces  of  attraction,  there  will  also  be  repulsive  forces; 
these  exceed  the  Coulomb  forces  at  small  distances  and  prevent 
the  two  atoms  from  approaching  closer  than  the  distance  R.  In 
any  case,  this  elementary  discussion  explains  the  principal  physical 
processes  involved  in  the  formation  of  heteropolar  molecules;  it 
also  explains,  however  qualitatively,  the  ionic  structure  of  their 
crystal  lattice  and  the  dissociation  of  these  molecules  into  indi- 
vidual ions,  a  process  which  occurs  in  solutions. 


C.  THE  MOLECULAR  HYDROGEN  ION 

Aside  from  ionic  compounds,  there  exist  molecules  formed 
directly  from  the  neutral  atoms,  rather  than  from  ions.  The  simplest 
representatives  of  these  molecules  are  H2,  O2  and  Ns.  These  are 
called  homopolar  molecules. 

The  formation  of  homopolar  molecules  cannot  be  explained 
on  the  basis  of  classical  theory  or  Bohr's  semiclassical  theory. 
These  theories  are  useful  only  for  compounds  held  together  by 
electrostatic  forces  such  as,  for  example,  ionic  compounds. 
Before  discussing  the  formation  of  homopolar  molecules,  let  us 
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consider  a  very  simple  case,  namely,  the  molecular  hydrogen  ion 
Ha  ,  which  consists  of  two  hydrogen  nuclei  and  a  single  electron. 
This  analysis  is  important  for  methodological  reasons,  because  it 
enables  us  to  express  in  comparatively  simple  mathematical  form 
the  features  of  the  bond  that  arises  between  two  hydrogen  nuclei 
owing  to  the  exchange  of  an  electron  (exchange  forces).  The  same 
forces  also  appear  in  the  homopolar  hydrogen  molecule  H2. 

Let  us  denote  the  distance  between  the  two  hydrogen  nuclei  a 
and  d  by  R  and  assume  that  R  changes  adiabatically;  that  is,  R 
changes  so  slowly  that  it  can  be  regarded  as  a  constant  in  solving 
the  Schrodinger  equation.  In  a  more  exact  treatment,  it  is  neces- 
sary to  take  into  account  the  vibrations  of  the  nuclei  about  the 
equilibrium  position  (the  vibrational  spectrum),  and  the  rotation 
of  the  nuclei  about  the  center  of  mass  (the  rotational  spectrum), 
These  questions  have  been  treated  in  detail  in  Chapter  12. 

Suppose  r  and  S  represent  the  distances  between  the  electron 
and  the  nuclei  a  and  d  ,  respectively.  The  Schrodinger  equation 
for  the  ionized  hydrogen  molecule  can  then  be  written  as 

(£  —  H)«p  =  0.  (27.1) 

Here  the  Hamiltonian  is 

H-T--f  -  *?+-5[,  (27.2) 

and  the  kinetic  energy  operator  has  the  form 

i    /  ft     V2       i    /  n    ,  \« 

(27.3) 


where  v.v  =  vi  (V,  =  -~'   vi  =  -r)  slnce 


(27.4) 


and  R  may  be  regarded  as  a  constant  in  the  problem  under  con- 
sideration. 

We  shall  restrict  ourselves  to  an  investigation  of  the  ground 
state,  and  carry  out  our  calculations  with  the  help  of  perturbation 
theory.  The  spin  effects  can  be  neglected  in  the  case  of  a  single 
electron. 

In  the  zeroth  approximation  we  assume  that  the  electron  is 
under  the  influence  of  either  nucleus  a  or  nucleus  a'  (see  Fig.  27.4). 
The  Schrodinger  equations  describing  these  two  possible  unper- 
turbed states  are 


(27.5) 
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Both  eigenvalues  and  eigenf unctions  are  identical  and  correspond 
to  the  Is  state  of  the  hydrogen  atom.  Since  one  of  the  wave 
functions  is  associated  with  nucleus  a,  and  the  other  with  nucleus 
a',  we  may  write  (/i=l,  /  =  m  =  0) 


=       (r)~~      =       (?)  (2?*6) 

where  fynim  =  fym  is  the  wave  function  of 
the  ground  state  of  the  hydrogen  atom 
[see  (13.32)],  namely, 


The   total  energy  of  the  system  and  the 
zero-order  eigenfunction  are  as  follows: 

r-«\  r«  r**« 

(27.8) 


The  uncertainty  in  6°  is  due  to  the  fact 
that  the  presence  of  two  nuclei  leads  to  a 
degenerate  state  of  the  system.  In  solving 
(27.1)by  perturbation  theory,  we  must  set 


Electron  1  is  near 
nucleus  Q 


(27.7)        *^- 


a  R  a1 

Electron  1  is  near 

nucleus  a' 


Fig.  27.4.  Diagram  of  the  inter- 
action of  particles  in  the  H2 
molecular  ion  The  solid  arrows 
represent  the  interaction  of  the 
particles  m  the  zeroth  approxi- 
mation The  arrows  depicted  by 
a  dashed  line  represent  the  in- 
teraction corresponding  to  the 
perturbation 


(27.9) 


Substituting  (27.9)  into  (27.1)  and  restricting  ourselves  to  the  first- 
order  quantities,  we  find 


(27.10) 


For  the  solution  $a  (the  electron  near  nucleus  a),  --^  represents  the 
main   interaction,   and  -  ii  the  perturbation.    On  the  contrary,  for 

*"  p2 

the  solution  <|>a,  (the  electron  near  a'),  the  main  interaction  is-yl, 
and  the  perturbation  is  -  —  . 

From  Eq.  (27*10)  we  can  find  the  additional  energy  £',  and 
also  a  relationship  between  the  coefficients  Ct  and  C2  in  the  wave 
function  ^°.  This  can  be  done  because  the  perturbation  energy 
removes  the  degeneracy  (just  as  in  the  case  of  the  helium  atom). 
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To  solve  this  problem,  we  can  make  use  of  the  theorem  which 
states  that  the  solution  of  the  homogeneous  equation  [that  is,  one 
of  the  solutions  of  Eqs.  (27.5)]  must  be  orthogonal  to  the  right-hand 
side  of  the  same  equation,  that  is,  Eq.  (27.10).  Assuming  that  the 
electron  is  near  the  nucleus  a,  we  can  neglect  the  perturbation 

energy  £?   on  the  left-hand  side  of  the  equation,  since  it  gives  us 

a  second-order  term  when  multiplied  by  <]/.  Then  the  solution  of 
the  homogeneous  equation  will  be  the  function  fya  and  according  to 
the  theorem  we  have  just  stated,  it  must  be  orthogonal  to  the  right- 
hand  side  of  Eq.  (27.10) 

-|-?;-^)t|fd'v  =  0     (27.11) 

Here  we  have  used  the  fact  that  the  wave  function  <j/(I  of  the  ground 
state  of  the  hydrogen  atom  is  real. 

In  exactly  the  same  way,  assuming  that  the  electron  is  near  the 
nucleus  a',  we  may  neglect  the  perturbation  energy  on  the  left- 
hand  side  of  Eq.  (27.10)  (in  this  case  the  perturbation  energy  is  -*  ). 

Since  the  solution  of  the  homogeneous  equation  will  now  be  the 
function  fya,9  we  obtain  a  second  equation  for  the  unknown  quantities 


(27.12) 


When  r  is  replaced  by  /,^,  becomes  <|>a,f  whereas  in  the  reverse 
substitution  <J>a,  becomes  6rt.  In  both  cases,  the  volume  element 
d*x  remains  unchanged  (d*x  —  d*tf).  Therefore,  we  can  reduce 
(27.12)  to  a  form  that  is  in  agreement  with  (27.11),  but  with  the 
coefficients  C\  and  C2  interchanged 


-]-  C,       «[,„  {£'  -j-  '   -       t0,d'x  =  0.  (27.13) 


In  further  transforming  Eq.  (27.11),  we  must  not  forget  that  the 
function  <|»0  is  normalized  to  unity 


1,  (27.14) 

but  is  not  orthogonal  to  the  function  t|»0« 

(27.15) 
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This  follows  from  the  fact  that,  although  the  expression  S(R) 
vanishes  for  R  ->  oo  (there  will  be  no  points  where  the  two  functions 
<l>a  and  tya,  both  differ  from  zero),  it  becomes  equal  to  unity  for 

«  =  0(^  =  ^f)- 

Let  us  introduce  the  notation 


and 


where  K  is  the  Coulomb  energy  of  interaction  between  the  hydrogen 
atom  and  the  (atomic)  hydrogen  ion,  and  A  is  the  exchange  energy, 
which  has  no  classical  analog.  This  energy  arises  because  the 
electron  can  be  in  both  states  tya  and  tya,  simultaneously  (that  is, 
exchange  occurs  between  states  a  and  d  ).  Formally,  the  existence 
of  the  exchange  energy  is  reflected  in  the  fact  that  the  expression 
for  t|>  contains  <jv,  as  well  as  <|>a.  As  we  shall  see  in  what  follows, 
it  is  this  exchange  energy  which  gives  rise  to  an  attractive  force 
between  the  nuclei.  At  certain  values  of  the  internuclear  separation, 
this  force  exceeds  the  force  of  repulsion,  and  as  a  result  the  Ha 
molecular  hydrogen  ion  is  formed.  It  is  worth  noting  that  this 
mixed  state  cannot  arise  in  the  Bohr  theory,  and  therefore  the 
existence  of  an  ionized  hydrogen  molecule  can  be  explained  only 
in  terms  of  quantum  mechanics. 

With   the   help   of    (27.14)-  (27.  16),  we  can  reduce  Eqs.  (27.11) 
and  (27.13)  to  the  form 


G!  (E'S- 

obtaining  two  solutions:  a  symmetric  one  (Cl==C2)  and  an  antisym- 
metric one  (Ci  =  —  Q).  The  symmetric  solution  is 


(27.18) 
K^t 

and  the  antisymmetric  solution  is 

_    (ft      -  £      \ 

y<l     v    ' 

(27.19) 
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The  factor     f     .      ~-  is  the  normalization  coefficient.  We  may  note 
V  *  (i  ±  &) 

here  that  the  symmetric  and  antisymmetric  solutions  are  already 
mutually  orthogonal 


The  quantity  /:'  may  be  regarded  as  an  additional  interaction  V  (/?), 
which  binds  the  H  atom  and  the  H+  ion  into  a  stable  hydrogen 
molecule.  In  order  to  find  the  specific  form  of  this  interaction  as 
a  function  of  R,  we  must  calculate  the  values  of  ,S,  K  and  A.  For 

g—ktf 

this  purpose  we  use  the  well-known  expansion  of  the  function  — 
into  a  triple  Fourier  integral 


i    r*  eikr  d*k 
~  =  W  J  F+lfcjf  '  (27.20) 


After  differentiating  this  expression  with  respect  to  the  parameter 
k;},  we  can  easily  write  the  wave  function  for  the  ground  state  of  the 
hydrogen  atom  in  the  form  of  a  Fourier  integral 


where  /e0  =  -. 


In  accordance  with  (27.15),  the  expression  for  5  becomes 

S  =  J  tioo  (r)  tioo  (r  -  K)  d*x  (27.22) 


We  may  replace  <|>iM(r)  and  ^Wo  (/"  —  /?)  by  their  expressions  in 
terms  of  the  Fourier  integral  (27.21).  Integrating  with  respect  to 
volume,  we  obtain 


(27.23) 
and  thus  we  find  the  following  value  for  5: 

—  TIB  j  (V  -f  A*)4  c    "7 

(27.24) 
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The  integral  in  this  equation  can  be  calculated  with  the  aid  of  Eq. 
(27.20),  which  should  be  differentiated  three  times  with  respect  to 
the  parameter  /?; . 

In   order   to   calculate  the  additional  Coulomb  interaction,  we 
shall  use  the  relation 


1  b\      P         0 

=  4*->"«.  =&  J  jp 


where  we  must  set  ^  =  ->  Then,  putting  &0=  0  in  (27.20),  we  obtain 


<27-25> 


In  exactly  the  same  way,  we  may  represent  the  quantity  \/R  in  the 
form  of  the  integral  (27.25).  Substituting  these  quantities  into 
formula  (27.16)  and  integrating  over  the  volume  dAx  with  aid  of 
(27.23),  we  find 


K=  £  J  ^^(^  +  ^_).  (27.26) 


9 

Hence,  using  (27.20)  and  setting  &0=— ,  we  obtain  an  egression 
for  the  additional  Coulomb  energy 

off 


(27.27) 
In  calculating  the  exchange  interaction 

A  =  ^  S  -  el  J  i^-®  )m (r-  /?)  £pjc  (27.28) 

we  may  use  (27.21)  for  fj,,M(/—*)  and  (27.20)  for  ±^L 

tioo(r)_  *:/ 
""" 


where  ^o  =  ~.  Hence 


pi  4/?V8    (*    c      d*k 

==~S         £±  j  fjrrrjfj).  = 
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Substituting  the  values  found  for  5,  K  and  A  into  the  expression 
for  the  interaction  energy  [see  (27.18)  and  (27.19)],  we  obtain  the 
following  expression  (Ri'a1t  =  k): 


.  —   ..2  L,-S 


(27.30) 


Here   the  upper  signs  (-)  refer  to  the  antisymmetric  solution  Va, 
while   the   lower   signs    (+)  refer  to  the  symmetric  solution.  For 

small  values  of  R  (R  -  a0)t  we  have 


=  Vs  = 


a,  t  (27.31) 


Fig.  27.5.  Curves  of  the  interaction 
energy  m  the  molecular  hydrogen 
ion  as  a  function  of  the  distance  H 
between  the  nuclei  (in  units  of  OQ) 
for  the  symmetric  (V)  and  anti- 
symmetric (Va)  states 


and  thus  the  energy,  as  we  would  expect, 
is  determined  by  the  Coulomb  energy  of 
repulsion  between  the  two  nuclei.  For 
large  distances  (/?>a0)  we  have 


(27.32) 


that  is,  the  antisymmetric  solution  (-1-) 
gives  a  repulsion,  whereas  the  sym- 
metric solution  (-)  gives  an  attraction. 
The  general  nature  of  the  varia- 
tion of  Vcl  and  l/s  as  a  function  of  R  is 


depicted  in  Fig.  27.5  which  also  shows 
the    experimental    data    on    the   inter- 
action   energy.    It   can  be    seen  from 
this    figure    that    only    the    symmetric 
state  is  realized  in  practice. 
Theoretical  values  obtained  from  the  graph  give  the  equilibrium 

distance    as    /v>,^  2.50a0^  1.32   A,    and  therefore   the   ionization 

energy  is 


D  ---   -  V  (/?„)  =  -f  0.0040  $  =  1 .76  ev. 

The  corresponding  experimental  values  are 

^l.OiiA,     Z)exP  =2  79  ev 


(27.33) 


(27.34) 

(the    zero-point    energy    of    oscillations    is    not   included   in  the 
theoretical  and  experimental  values  given  here). 
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The  discrepancy  between  the  theoretical  and  the  experimental 
data  is  due  to  the  fact  that  here,  just  as  for  the  helium  atom,  the 
perturbation  energy  is  commensurable  with  the  energy  of  the 
zeroth  approximation.  Solving  this  problem  by  the  variational 
method,  using  a  test  function  of  the  form 


where  Z',  the  effective  charge  of  the  nucleus,  is  taken  as  the 
variational  parameter,  we  can  obtain  values  for  /?0  and  D  which  are 
in  considerably  better  agreement  with  experiment 

/?¥«•=,  it06  A,    Dvar=2.25  ev. 

If  several  parameters  are  introduced,  the  variational  method  gives 
results  that  are  in  practically  complete  agreement  with  the 
experimental  data. 

It  can  be  seen  that  the  formation  of  the  molecular  hydrogen 
ion  is  essentially  due  to  the  quantum- mechanical  exchange  forces, 
which  in  the  symmetric  state  give  rise  to  a  stable  molecule. 
From  the  physical  point  of  view,  this  can  be  explained  as  follows. 
The  probability  of  the  electron  being  in  the  symmetric  state  is 


(27.36) 
whereas  the  probability  for  the  antisymmetric  state  is 

(27-37) 

If  we  plot  the  curves  of  constant  probability  density  of  the  electron 
(see  Fig.  27.6),  we  see  that  the  electron  tends  to  be  located  at  the 
midpoint  of  the  line  joining  the  two  nuclei  in  the  case  of  the  sym- 
metric solution,  whereas  in  the  case  of  the  antisymmetric  solution 
the  position  probability  Danishes  at  this  point.  Since  the  electron 
binds  the  two  nuclei  most  strongly  when  it  is  halfway  between  them, 
it  is  natural  to  expect  that  the  first  solution,  and  not  the  second, 
will  lead  to  the  formation  of  a  molecule.  Moreover,  in  the  case  of 
the  symmetric  solution,  the  curves  showing  the  electron  distribution 
about  the  nuclei  tend  to  merge  when  the  nuclei  approach  one  another; 
this  provides  a  graphical  characterization  of  the  homopolar  bond.4 


4For  more  details   see   P.   Gombas,   Theorie   und  Losungsmethodcn  des  Mehrte  lichen- 
problems  der  Wellenmechantk,  Basel    Birkhauser-Verlag,  1950. 
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Two  hydrogen  nuclei,  or  a  hydrogen  and  a  deuterium  nucleus,  can 
also  be  linked  by  other  particles  besides  electrons.  We  may 
mention  in  this  connection  the  u-  me  sic  molecule  (HD)1,  in  which  the 
bonding  between  the  hydrogen  and  deuterium  nuclei  is  brought 
about  by  a  negative  n  meson.  Alvarez  produced  a  mesic  molecule 
of  this  type  bypassingnegative^  mesons  through  a  bubble  chamber. 
The  radius  of  such  a  molecule,  as  calculated  from  the  equation 

/? i=2.5  --l--p,   will  be  1/200  the  radius  of  the  molecular  hydrogen 

' 


Symmetric 
state 


Antisymmetric 
state 


Fig.  27.6.    Electron   density    distribution   in    the  molecular 
hydrogen  ion. 


ion,  since  the  mass  of  the  u,  meson  is  approximately  1/200  as 
large  as  the  electron  mass.  Thus,  when  the  nucleus  of  the  hydro- 
gen atom  approaches  the  nucleus  of  the  deuterium  atom,  they 
form  a  common  nucleus,  namely  that  of  the  >IIe*  molecule 


As  a  result,  an  energy  of  5.4  Mev  is  released  and  carried  away  by 
the  n  meson.  Thus  the  n  meson  acts  almost  as  a  catalyst  of  the 
nuclear  reaction. 


D.  HOMOPOLAR  ATOMIC  MOLECULES 

The  first  successful  attempt  to  give  a  theoretical  explanation  of 
the  homopolar  molecule  was  made  by  Heitler  and  London  (1927) 
with  the  help  of  quantum  mechanics.  In  a  homopolar  molecule, 
exchange  forces  play  a  fundamental  role.  In  their  treatment, 
Heitler  and  London  used  perturbation  theory,  which  does  not  give 
completely  accurate  quantitative  results.  Although  more  accurate 
quantitative  results  can  be  obtained  by  means  of  the  variational 
method,  the  Heitler-London  theory  enables  us  to  bring  out  in  a 
very  simple  way  the  physical  features  of  the  homopolar  bond. 

Let  r\  and  rL  denote  the  position  vectors  of  the  first  and  second 
electrons  relative  to  nucleus  a,  and  r\  and  r^  the  position  vectors 
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of  the  electrons  relative  to  nucleus  a'  (see  Fig.  27.2).  Then 
r\~ri~  /?,  ra  =  r2  —  R.  In  the  zeroth  approximation  we  obtain  two 
wave  functions  which  are  products  of  the  ground-  state  hydrogen 
wave  functions 

*aa>  =  ta  (^i)  *„'  W)  =  <t>100  (r,)  <)1M  (r,  —  /?), 

*a'«  =  »«'  (r;)  *a  (r,)  = 


The  first  solution  <!>aa,  corresponds  to  the  case  when  the  first  electron 
is  near  the  nucleus  a  (and  the  second  electron  near  the  nucleus  a'), 
while  the  second  solution  tya,a  corresponds  to  the  case  when  the 
first  electron  is  near  the  nucleus  a',  and  the  second  electron  is 
near  the  nucleus  a.  Both  these  possibilities  are  depicted  in  Fig. 
27.7,  where  the  solid  arrows  show  the  atomic  bonds,  and  the 
dashed  lines  show  the  molecular  bonds.  When  the  distance  between 
the  nuclei  tends  to  infinity  (/?->oo)f  all  molecular  bonds  vanish. 


a1 


Fig.  27.7.  Diagram  of  the  interactions 
in  the  H2  molecule  The  solid  lines 
join  the  particles  whose  interaction  is 
taken  into  account  in  the  zeroth  ap- 
proximation The  dashed  lines  denote 
interactions  which  are  regarded  as 
perturbations,  a  and  a'  are  the  nuclei 
of  the  hydrogen  atoms,  1  and  2  are 
electrons 

Just  as  in  the  problem  of  the  molecular  hydrogen  ion,  the  main 
solutions  (27.38)  that  give  rise  to  an  additional  degeneracy  of  the 
system  will  not  be  orthogonal5 

,.»a^d^i^i={ J  ^oo  (r)  4,0,,  (r  -  /?)  fx  }a  =-.  5 ',     (27.39) 


5A  bar  over  a  symbol  denotes  quantities  referring  to  the  neutral  molecule. 


452  RELAT1VISTIC    QUANTUM    MECHANICS 

where    S   for   the  HJ   ion   is  given  by  the  expression  (27.34).  The 
Coulomb  energy  of  interaction  of  the  two  atoms  is  given  by 


=  j  tf,  {f  +  £-"#  -3}**.  d^,  (27.40) 


where  the  first  and  second  parenthetic  terms  in  the  integrand  cor- 
respond to  the  potential  energy  of  repulsion  between  the  two  nuclei 
and  the  two  electrons,  and  the  third  and  fourth  terms  correspond  to 
the  potential  energy  of  attraction  between  the  first  electron  and  the 
nucleus  a'  and  between  the  second  electron  and  the  nucleus  a. 
In  exactly  the  same  way,  we  obtain  the  exchange  energy 


A  -  J 


<27-41> 


The  expressions  for  K  and  A  can  be  computed  aproximately  by 
the  same  method  as  in  the  theory  of  the  H|  ion.  For  K  we  obtain 
a  comparatively  simple  result,  and  for  A  a  more  complicated 
result,  since  A  is  expressed  in  terms  of  an  integral  logarithm  (as 
shown  by  Sigura).  The  general  character  of  the  solution,  however, 
remains  the  same  as  in  the  theory  of  the  1  1]  ion.  In  particular, 
the  main  forces  which  hold  the  two  neutral  atoms  in  the  molecule 
are  the  exchange  forces.  These  forces  have  a  minus  sign  at 
comparatively  large  interatomic  distances  and  correspond  to  the 
mutual  attraction  of  the  atoms.  Just  as  in  the  case  of  the  molecular 
hydrogen  ion,  we  have  two  solutions.  The  first  solution  is  sym- 
metric 


2(l|-5)  (27.42) 


and  the  second  is  antisymmetric 

*a=i7-1—  -(*«B. 

1/2(1-  S) 


(27.43) 


The  general  form  of  the  curves  of  Va"s  for  a  neutral  hydrogen  mole- 
cule is  approximately  the  same  as  for  the  H^  molecular  ion;  there- 
fore, only  symmetric  solutions  will  give  stable  molecules.  For  the 
radius  corresponding  to  the  equilibrium  position  [that  is,  the 
minimum  of  the  potential  energy  Vs  (R)  for  the  symmetric  solution], 
we  obtain  /?„—  1.51a0~  0.80  A.  The  corresponding  value  for  the 
dissociation  energy  is 

=  0.115=3.2  ev.  (27.44) 
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The  experimental  values  of  these  quantities  are 

fl0c*P  =  0.74A,    De*P  r=4.73ev.  (27.44a) 

We  have  omitted  here  the  zero-point  energy  of  oscillations,  0.27  ev 
(see  Chapter  12),  from  both  the  theoretical  and  the  experimental 
values. 


E.    SPIN  AND  THE  SYMMETRY  OF  STATES 

In  the  HS  ion  and  the  H  atom,  there  is  only  one  electron,  and  its 
spin  leads  only  to  insignificant  spin-orbit  interactions.  On  the 
other  hand,  there  are  two  electrons  in  the  H2  configuration,  and  the 
spin  plays  an  important  role  in  the  theory  of  this  molecule,  even 
though  the  spin-orbit  and  the  spin-spin  interactions  give  only 
small  corrections.  In  the  hydrogen  molecule,  just  as  in  the  helium 
atom,  the  mutual  orientation  of  the  spins  of  the  two  electrons 
determines  the  type  of  symmetry  of  the  spatial  part  of  the  wave 
function;  this  is  of  primary  significance  in  connection  with  the 
stability  of  a  molecule.  We  shall  therefore  consider  more  fully 
the  question  of  the  relation  of  the  spin  to  the  symmetry  properties 
of  the  molecule. 

The  total  wave  function  ^  must  contain  a  spin  part  in  addition 
to  the  spatial  part.  When  the  potential  energy  of  the  spin-orbit 
interaction  can  be  neglected,  then,  just  as  in  the  case  of  Russell- 
Saunders  coupling,  the  total  wave  function  can  be  represented  by 
a  product  of  a  spatial  part  and  a  spin  part.  For  electrons  (which 
obey  Fermi  statistics)  the  total  wave  function  must  change  sign 
when  coordinates  and  spins  are  interchanged  (that  is,  the  solution 
must  be  antisymmetric).  We  therefore  have  two  possibilities 


W1==Ca  (5.,  s,)<l>s  (/•„  r*)>  (27.45) 

Ta  =  Cs(slt  sa)<M/-i,  ra).  (27.46) 

It  has  been  shown  in  Chapter  24  that  the  antisymmetric  spin 
function  Ca  describes  two  electrons  with  antiparallel  spins,  and 
therefore  function  <]>s,  which  is  symmetric  in  the  position  coordinates, 
corresponds  to  a  state  with  total  spin  0.  In  exactly  the  same  way 
the  symmetric  spin  function  Cs,  as  well  as  the  antisymmetric 
spatial  function,  describe  a  state  with  total  spin  1  (the  spins  of 
both  electrons  are  parallel).  In  the  case  of  the  hydrogen  molecule, 
the  only  solutions  which  results  in  attraction  is  that  corresponding 
to  <f»s  ;  thus,  a  stable  molecule  is  obtained  only  in  the  case  in  which 
the  electron  spins  are  antiparallel. 

We  shall  now  proceed  to  a  general  analysis  of  the  states  of  a 
molecule,  using  the  symmetry  properties.   In  this  connection,  we 
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note  that  in  diatomic  molecules  the  field  of  force  possesses  axial 
symmetry  with  respect  to  the  line  passing  through  the  nuclei 
(the  symmetry  axis  of  the  molecule).  The  absolute  value  of  the 
component  of  the  total  orbital  angular  momentum  along  this  axis 
of  symmetry  (which,  incidentally,  must  be  conserved)  is  denoted 
by  A .  States  corresponding  to  different  A  are  denoted  by  the 
following  letters:  £  (A  =  0);  II  (A  =  1);  A  (A—  2);  etc. 

In  addition,  each  electronic  state  must  be  characterized  by  the 
total  spin  5  of  all  the  electrons  in  the  molecule.  For  a  given  value 
of  5,  v  —  2S  [  1  states  are  possible.  The  quantity  v,  as  in  the 
case  of  an  atom,  determines  the  multiplicity  of  the  energy  level.  In 
the  case  where  the  total  spin  is  equal  to  zero  (S  =  0) ,  we  have 
v=  1.  For  states  with  S=l,  the  multiplicity  vr=3,  etc.  The 
total  spin  of  electrons  in  a  molecule  can  therefore  be  characterized 
by  the  multiplicity  v,  and  the  corresponding  term  can  be  denoted 
by  VA. 

In  this  notation,  the  symmetric  solution  for  the  spatial  part  of 
the  wave  function  (that  is,  the  solution  ^s)  corresponds  to  the  term 
!S  (that  is,  A=  0,  S  =  0f  v=  1),  while  the  antisymmetric  solution 
(t|)a)  corresponds  to  the  term  8v  (A—  0,  S  =  1,  v= -  3).  It  is  obvious 
that  the  ^  term  corresponds  to  three  states:  in  two  of  the  states 
the  spin  is  directed  along  the  symmetry  axis  of  the  molecule  (in 
a  parallel  or  antiparallel  direction),  while  in  the  third  state  it  is 
perpendicular  to  the  symmetry  axis. 

It  should  be  noted  that  symmetry  plays  a  very  important  role 
in  the  theory  of  molecules  (particularly  in  the  case  of  complex 
molecules).  If,  for  example,  we  reflect  the  wave  function  in  a 
plane  passing  through  the  symmetry  axis  of  the  molecule  (which 
we  take  as  the  z  axis),  the  energy  of  the  molecule  must  remain 
unaltered.6  At  the  same  time,  if  the  component  of  the  orbital 
angular  momentum  or  of  the  spin  along  the  symmetry  axis  differs 
from  zero  (A  ;A0  or  5,^0),  the  rotations  which  are  associated 
with  these  angular  momenta  will  be  reversed  as  a  result  of  this 
reflection.7 

For  simplicity,  we  restrict  our  treatment  to  the  states  in 
which  the  orbital  angular  momentum  is  zero,  that  is,  A— 0  (V 
terms).  In  the  case  where  the  total  spin  of  the  electrons  also 
vanishes,  that  is,  S— 0,  no  change  of  states  will  occur  in  the 
mirror  reflection. 


In  the  case  of  reflection  in  the   r*  plane,  mirror  reflection  amounts  to  a  replacement  of 

v  by  —  v 

As  is  well  known,  the  angular  momentum  L  —  r  x  p  is  an  axial  vector  whose  direction 
is  a  matter  of  convention  (it  has  one  direction  in  a  right-handed  system  of  coordinates, 
and  the  opposite  direction  in  a  left-handed  svstem)  The  direction  of  the  contour  bounding 
the  area  and  constructed  from  the  vectors  r  and  p  remains,  however,  unaltered  in  both  the 
right-  and  left-handed  coordinate  systems 
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If,   however,  the  spins  of  both  electrons  are  parallel  (5  =  1), 
the  following  cases  are  possible. 

(a)  The  component  of  spin  along  the  symmetry  axis  is  equal  to 
zero   (S;~0).    In  this  case,  the  rotation  characterizing  the  spin 
remains  unaltered  as  a  result  of  mirror  reflection  (see  Fig.  27.8, 
where    the    initial    and    reflected   spins    are   characterized   by  a 
rotation    denoted    by    //  and  IT).    The   corresponding  terms  are 
designated  by  the  symbol  %?. 

(b)  The  component  of  spin  along  the  axis  of  symmetry  z  differs 
from  zero  (Ss  =  -•"  1).  In  this  case,  the  rotation  which  we  associate 
with  the  spin  is  reversed  as  a  result  of  mirror  reflection  (see  Fig. 
27.8,  where  the  initial  spin  is  characterized  by  the  rotation  /  and 
the  reflected  spin  by  the  rotation  /'). 


n-n. 


Fig.  27.8.  Change  of  the  angular  momentum  on  reflection 
in  the  AA'B'B  plane,  which  passes  through  the  axis  of  sym- 
metry z  If  the  initial  rotation  characterizing  the  angular 
momentum  occurs  in  a  plane  perpendicular  to  A.\  B  H  (see  /), 
the  direction  of  this  rotation  will  be  reversed  after  reflection 
(see  /')  If,  however,  the  rotation  takes  place  in  the  plane  of 
reflection,  it  will  be  unaltered  by  reflection  (//'-=//) 

The  terms  whose  spin  changes  on  reflection  are  designated  by 
the  symbol  ]£~. 

Therefore,  the  following  terms  of  the  ground  state  of  the  hydro- 
gen molecule  are  possible: 


«2'(A=0,  5  =  0), 

«v;+(A  =  0,  5=1,  S,  =  0), 

•'v-(A  =  0,  S=l,  5,.=  J  1), 


(27.47) 


where  the  last  term  is  obviously  twofold  degenerate. 

If  the  molecule  consists  of  two  identical  atoms,  there  will  be  an 
additional  symmetry  property.  A  diatomic  molecule  with  identical 
nuclei  must  have  a  center  of  symmetry,  in  addition  to  a  plane  of 
symmetry.  This  center  of  symmetry  is  the  mid-point  on  the  line 
joining  the  nuclei.  In  Fig.  27.8,  it  is  located  at  the  origin  of  the 
coordinate  system,  that  is,  at  the  point  2=0.  In  this  symmetry 
transformation  we  must  change  the  sign  of  the  coordinates  of  all 
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the  electrons.  In  particular,  under  this  symmetry  transformation 
the  positions  of  electron  1  and  electron  2  will  be  interchanged  in 
the  hydrogen  molecule  (the  coordinates  of  the  nuclei  are  left  un- 
changed). The  symmetric  wave  function  6s  will  remain  unaltered; 
that  is,  it  is  even  (this  is  denoted  by  the  subscript  g)m  The  anti- 
symmetric function  y  changes  its  sign;  that  is,  it  is  odd  (this 
is  denoted  by  the  subscript  u).  The  main  possible  states  of  the 
hydrogen  molecule,  taking  into  account  both  symmetry  properties, 
can  be  therefore  denoted  as  follows: 

IV1    3V"    <JV~ 

liv  2^«»  JL« 

and  so  forth. 

The  importance  of  symmetry  with  regard  to  the  formation  of  a 
molecule  follows  also  from  the  fact  that  the  ground  state  of  most 
diatomic  molecules  is  a  state  in  which  the  wave  function  is  invariant 
under  all  symmetry  transformations.8  Thus,  ^  is  the  main  term 
of  the  hydrogen  molecule.  The  question  of  molecular  symmetry, 
however,  lies  outside  the  scope  of  this  book. 

It  should  be  noted  that  in  stable  states  of  the  hydrogen  molecule 
the  spins  of  the  two  electrons  are  always  oppositely  directed.  At 
the  same  time,  there  are  two  types  of  hydrogen  molecules— para- 
hydrogen  and  orthohydrogen.  These  names  refer  to  the  orientation 
of  the  nuclear  spins,  and  not  to  the  orientation  of  electron  spins. 
In  parahydrogen  the  spins  of  the  nuclei  are  antiparallel,  while  in 
orthohydrogen  they  are  parallel.  Since  the  number  of  possible 
states  for  two  particles  with  parallel  spins  is  three  times  larger 
than  in  the  case  of  particles  with  antiparallel  spins,  ordinary  hydro- 
gen at  room  temperature  will  consist  of  an  equilibrium  mixture  of 
25%  parahydrogen  and  75%  orthohydrogen.  As  the  temperature  is 
lowered  in  the  presence  of  a  catalyst  (for  example,  charcoal),  the 
percentage  of  parahydrogen  in  the  equilibrium  mixture  increases, 
and  is  practically  100%  at  0°K.  Parahydrogen  produced  at  low 
temperatures  is  extremely  stable  and  can  be  preserved  in  such 
an  equilibrium  system  for  a  period  of  several  weeks  at  room 
temperature.  Orthohydrogen  has  not  yet  been  obtained  in  pure 
form.  The  difference  in  the  thermal  conductivities  at  low  tempera- 
tures (the  thermal  conductivity  of  parahydrogen  is  larger)  is  used 
for  determining  the  composition  of  the  mixture.  Similarly,  para- 
hydrogen and  orthohydrogen  have  somewhat  different  dissociation 
energies  and  optical  properties. 

F.    THE  VALENCE  THEORY 

We  shall  now  explain  the  concept  of  chemical  valence  in  terms 
of  quantum  mechanics.  By  chemical  valence  we  mean  the  ability 


As   an  exception   to  this  rule,  let  us  mention,   for  example,    the  C>2  molecule,   for  which 
2L is  the  mam  term 
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of  an  atom  to  combine  with  a  specific  number  of  other  atoms.  As 
already  mentioned,  the  first  success  of  quantum  theory  in  con- 
nection with  the  chemical  properties  of  atoms  was  the  explanation 
of  heteropolar  chemical  compounds  (Kossers  theory);  these  com- 
pounds are  formed  as  a  result  of  the  redistribution  of  electrons 
in  the  outer  shells  of  the  participating  atoms.  According  to  this 
theory,  the  numerical  value  of  the  valence  is  determined  by  the 
number  of  electrons  which  an  atom  gives  up  to  another  atom  (posi- 
tive ionic  valence)  or  acquires  from  another  atom  (negative  ionic 
valence).  In  the  formation  of  a  molecule,  the  electrons  in  the 
outer  shells  of  atoms  are  redistributed  so  that  the  valences  of 
the  atoms  are  saturated. 

Further  progress  in  the  investigation  of  the  formation  of  a 
molecule  was  made  with  the  He  itler- London  theory.  This  theory 
succeeded  in  explaining  the  formation  of  the  simplest  homopolar 
molecule  H3,  which  serves  as  the  basis  of  our  present  concept  of 
the  covalent  bond.  According  to  the  He  itler- London  theory,  the 
spins  of  the  valence  electrons  are  mutually  compensated  in  the 
homopolar  hydrogen  molecule.  Generalizing  these  results,  it  is 
possible  to  conclude  that  the  formation  of  homopolar  molecules 
occurs  under  the  condition  of  mutual  compensation  of  the  spins 
of  the  valence  electrons.  Accordingly,  this  type  of  valence  is 
also  sometimes  called  the  spin  valence. 

Since  the  saturation  of  valence  bonds  amounts  to  a  compensation 
of  the  spins  of  the  valence  electrons,  the  chemical  valence  of  atoms 
is  given  by  the  number  of  electrons  with  an  uncompensated  spin 
present  in  the  outer  shell. 

To  illustrate  these  general  principles,  let  us  consider  some 
specific  examples.  Figure  27. 9  gives  the  ground-state  configurations 
of  several  elements  of  the  periodic  system.  The  electron  states 
are  shown  as  boxes,  while  the  electrons  are  denoted  by  arrows 
whose  directions  correspond  to  their  spin  orientations.  It  is  clear 
from  this  figure  that  the  configuration  of  the  outer  shell  of  the 
hydrogen  atom  (ls')25  corresponds  to  a  single  valence  bond.  The 
valence  of  hydrogen,  which  is  equal  to  one,  is  smaller  by  a  factor 
of  one  than  the  multiplicity  of  its  terms,  which  is  two  (the  multi- 
plicity is  designated  by  the  superscript  on  the  left-hand  side  of 
the  term  symbol  S). 

Similarly,  the  ground  state  of  the  helium  atom  has  the  con- 
figuration (2s'2)  It  is  evident  that  the  multiplicity  equals  one  ('S), 
while  the  valence  is  equal  to  zero. 

The  boron  atom  (Z  =  5)  has  the  ground  state  (ls'2^2/?1),  corre- 
sponding to  the  doublet  (2P),  and  consequently  the  valence  equals 
one.  The  excited  state  (Is-'2sl2^)»  corresponding  to  the  quartet 
(4P)  is  also  possible;  in  this  state  the  valence  of  boron  is  equal  to 
3.  Thus  the  presence  of  several  different  valences  in  the  elements 
of  various  groups  in  the  periodic  system  can  be  explained  in 
comparatively  simple  form  (see  Table  27.2). 
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Although  according  to  experiment  the  elements  of  oxygen  and 
halogen  groups  can  have  several  different  valences,  the  O  and 
F  atoms  themselves  show  only  the  principal  valence.  This  is 
due  to  the  fact  that  their  multiplicity  can  be  increased  only  if 
an  electron  is  transferred  to  a  shell  with  a  larger  value  of  the 
principal  quantum  number.  This  process  is  unfavorable  from 
the  energy  standpoint  (the  d  subshell  is  absent  in  O  and  F).  On 
the  contrary,  for  other  elements  of  these  groups  there  is  a  pos- 
sibility of  transition  between  states  of  the  same  shell,  but  having 
different  values  of  / 
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Fig  27  9.  Diagram  showing  the  filling 
of  the  electron  shells  of  several  atoms 
with  the  spin  taken  into  account  Homo- 
polar  valence  of  atoms  is  denoted  by  a 
dot,  and  ionic  valence  is  denoted  by 
a  \  (positive)  or  —  (negative)  sign 


Table  27.2 
Multiplicity  and  homopolar  valence 


Group  of  the 

periodic  sys- 

1 

II 

III 

IV 

V 

VI 

Vli 

tem 

Multiplicity 
Valence* 

2 
1 

1,3 
0,2 

2,4 
1,3 

1,  3,  5 
0,  2,  4 

2,  4,  6 
1,  3>  5 

],  3,  5,  7 
0,  2,  4}  6 

2,  4,  6,  8 
1,  3,  5,  7 

The  bold-face  type  indicates  the  principal  valence. 
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According  to  the  configuration  given  in  Fig.  27.9,  nitrogen  in 
the  ground  state  (Is22s*2p')  is  trivalent  (the  three  electrons  in  the 
2p  shell  have  parallel  spins).  However,  it  can  also  be  univalent 
(antiparallel  spins  of  the  two  electrons  in  the  2p  subshell)  and 
even  pentavalent  (Is*2sl2p3)  (the  four  spin  valences,  associated 
with  the  parallel  spins  of  the  electrons  in  the  2s  and  2p  subshells, 
are  augmented  by  a  fifth  ionic  valence  associated  with  the  removal 
of  a  second  electron  from  the  2s  subshell).  In  this  connection  we 
note  that  the  ionic  valence  of  oxygen  and  fluorine  is  the  same  as 
the  spin  valence  (covalence). 

It  should  be  emphasized  that,  in  general,  it  is  impossible  to 
divide  rigorously  the  chemical  bonds  into  homopolar  and  hetero- 
polar  ones.  The  two  types  of  bonds  correspond  to  the  limiting 
cases  of  the  electron  density  distribution  in  the  incomplete  shells. 
The  limiting  case  of  asymmetry  in  the  distribution  of  the  electron 
density  between  the  atoms  corresponds  to  a  heteropolar  molecule. 
Such  a  molecule  has  a  dipole  moment  and  can  be  regarded  as  an 
ionic  structure.  The  case  of  a  homopolar  bond  corresponds  to 
identical  electron  density  distributions  in  the  atoms  of  the  mole- 
cule. A  homopolar  molecule  has  no  dipole  moment,  and  it  can  be 
considered  as  a  structure  formed  from  two  neutral  atoms. 

Quantum  theory  provides  a  general  method  for  the  explanation 
of  valence  forces  and  deals  with  both  types  of  bonds  (homopolar 
and  heteropolar)  in  a  single  scheme.  One  of  the  chief  merits  of 
the  Heitler- London  quantum- mechanical  theory  of  the  II2  molecule 
is  that  it  explains  the  saturation  of  homopolar  bonds  in  terms  of 
the  saturation  of  the  spins  of  the  electron  shells  when  electrons 
combine  into  pairs  with  antiparallel  spins.  When  a  hydrogen  atom 
approaches  a  H2  molecule  no  additional  pairs  with  compensated 
spins  are  formed,  and  hence  there  is  no  gain  in  energy.  The  H., 
molecule  therefore  cannot  exist. 

It  must  be  emphasized,  however,  that  the  Heitler- London  theory 
was  developed  only  for  molecules  consisting  of  atoms  in  the  s 
state,  and  therefore  an  extension  of  its  conclusions  to  more  com- 
plex atoms  must  be  of  a  somewhat  qualitative  nature.  Further 
development  of  the  theory  of  homopolar  bonds  has  shown  that  in 
the  case  of  complex  atoms  quantum  laws  alone  are  not  sufficient. 
In  this  case  we  must  also  include  the  influence  of  the  specific 
properties  characteristic  of  the  chemical  compounds. 


G.    MASERS  AND  LASERS 

The  electromagnetic  waves  radiated  by  conventional  radio  transmitters  have  a  com- 
paratively wide  frequency  band.  Everyone  who  has  used  a  radio  receiver  knows  that 
transmitting  stations  which  have  nearly  equal  frequencies  overlap  one  another.  This  is 
due  to  the  fact  that  conventional  transmitters  have  insufficient  stability  and  often  "drift" 
into  another  frequency  band. 
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Thus,  the  stability  of  even  the  best  existing  quartz  oscillators  is  inadequate  in  a 
number  of  cases  (for  example,  high-stability  oscillators  are  necessary  for  accurate 
determination  of  distances  by  means  of  radar).  For  this  reason,  one  of  the  great  achieve- 
ments of  recent  times  was  the  development  of  masersQby  Townes  and  coworkers,  and 
independently  by  Basov  and  Prokhorov.  In  this  device,  quantum  transitions  between 
discrete  energy  levels  in  a  molecule  are  used  as  microwave  generators. 

We  note  first  of  all  that  the  process  of  spontaneous  emission,  which  depends  on  the 
Einstein  coefficient  A,  has  no  essential  significance  in  the  case  of  molecular  emission, 
since  its  intensity  is  proportional  to  w4  and  is  extremely  small  in  the  radiofrequency 
region  (that  is,  in  the  range  of  frequencies  that  are  low  compared  with  light  frequencies). 

As  far  as  induced  transitions,  which  are  proportional  to  the  Einstein  coefficient  B,  are 
concerned,  the  probability  of  upward  transitions  (resonance  absorption) and  the  probability 
of  downward  transitions  (induced  emission)  are  identical  (see  Chapter  9);  they  are  pro- 
portional to  the  energy  of  the  field.  Therefore,  a  system  (molecule)  with  two  levels  will 
undergo  a  transition  from  one  level  to  another  under  the  influence  of  a  sufficiently  strong 
external  field  containing  the  resonant  frequency,  the  transitions  being  accompanied  by  the 
emission  and  absorption  of  quanta. 


Kig.  27.10.  Structure  of  the  ammonia  molecule 

showing  two  mirror-symmetric  states   a  and  b 

of  the  same  energy. 

If  external  radiation  is  passed  through  a  substance,  it  will  interact  with  its  molecules 
and  cause  resonant  absorption  as  well  as  induced  emission.  In  accordance  with  the 
Boltzmann  distribution  (the  number  of  particles  in  an  equilibrium  system  with  energy  E 

is  proportional  to  e~K/kT)t  there  must  be  fewer  particles  in  the  higher  energy  states 
than  in  the  lower  states;  therefore,  absorption  dominates  induced  emission  in  the  thermal 
equilibrium.  The  excess  of  absorbed  energy  is  completely  converted  into  the  energy  of 
thermal  motion  of  the  molecules,  raising  the  temperature  of  the  gas. 

In  order  for  the  system  to  amplify  and  not  absorb  the  radiation  incident  on  It,  it  is 
necessary  to  disturb  this  thermal  equilibrium  in  such  a  manner  as  to  produce  greater 
occupancy  of  the  higher  energy  levels  than  of  the  lower  ones. 10  When  this  is  done, 
such  a  system  will  generate  electromagnetic  waves  with  extremely  small  line  width 
under  the  action  of  resonant  radiation.  The  first  masers  were  produced  with  ammonia 
(NH3)  molecules.  The  ammonia  molecule  consists  of  one  nitrogen  atom  and  three 
hydrogen  atoms  and  forms  a  right  pyramid.  However,  from  the  laws  of  symmetry,  the 
stable  state  of  the  ammonia  molecule  corresponds  not  only  to  the  case  in  which  the 
nitrogen  atom  is  located  above  the  triangle  composed  of  the  hydrogen  atoms  (Fig.  27.10a), 
but  also  to  the  case  In  which  the  nitrogen  atom  is  located  at  the  same  distance  under  the 
triangle  (Fig.  27.10bX  The  stable  states  a  and  £have  equal  energies  and  correspond  to  the 


The  term  "masor"  is  an  acronym  formed  from  the  first  letters  of  microwave  amplifi- 
cation by  stimulated  emission  of  radiation. 

Such  a  state  can  also  be  described  by  the  Boltzmann  distribution  function,  if  it  is 
assumed  that  the  temperature  T,  which  in  this  case  plays  the  role  of  a  parameter,  assumes 
negative  values 
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minima  of  the  potential  energy;  they  are  thus  separated  by  a  potential  barrier,  so  that 
classical  theory  does  not  allow  transitions  from  one  to  the  other.  In  quantum  theory, 
however,  the  probability  of  penetration  across  the  potential  barrier  is  different  from 
zero.  Consequently,  the  nitrogen  atom,  which  is  above  the  triangle,  may  reposition  it- 
self below  the  triangle  in  the  absence  of  any  external  action,  and  may  then  return  to  its 
original  position.  In  molecular  physics  this  phenomenon  is  called  inversion.  The  process 
of  inversion  can  be  explained  in  terms  of  the  existence  of  two  perfectly  identical  potential 
wells,  separated  by  a  potential  barrier  of  finite  width.  It  is  well  known  that  this  leads  to 
a  splitting  of  the  spectral  lines,  in  spite  of  the  fact  that  the  energy  states  in  the  two  po- 
tential wells  are  completely  identical  to  one  another.  For  the  ammonia  molecule  this 
splitting,  expressed  in  wavelengths,  is  1.27  cm  for  the  most  intense  line;  this  corre- 
sponds to  a  radiofrequency  wave. 

Another  remarkable  property  of  the  ammonia  molecule  is  that,  with  the  help  of  an 
electric  field,  it  is  fairly  easy  to  separate  the  molecules  in  the  upper  and  lower  energy 
levels  that  have  been  formed  as  a  result  of  inversion.  It  is  found  that  when  an  ammonia 
molecule  is  placed  in  an  electric  field,  the  two  levels  are  shifted  in  different  directions, 
the  upper  inversion  level  being  shifted  upwards  (the  energy  increases)  and  the  lower 
inversion  level  downwards. 

Since  any  system  tends  to  a  state  in  which  its  potential  energy  is  minimum,  we  first 
pass  a  beam  of  ammonia  molecules  through  a  carefully  evacuated  vessel  containing  an 
electrostatic  field  which  is  produced  by  a  quadrupole  capacitor;  this  field  decreases 
towards  the  symmetry  axis  (see  Fig.  27.1  IX 

In  this  separating  system  the  molecules  in  the  upper  level  will  tend  to  move  to  a 
region  where  this  field  is  minimum  (since  the  field  increases  their  energy);  that  is, 
they  will  be  focused  about  the  axis  of  the  capacitor.  Ilie  molecules  that  are  in  the  lower 
level,  however,  tend  to  move  to  the  region  of  maximum  field  (that  is,  to  the  periphery, 
where  their  energy  will  be  minimum),  and  thus  they  will  be  ejected  from  the  beam.  After 
passing  through  the  above  separating  system,  the  beam  of  molecules,  which  contains  a 
predominance  of  excited  molecules,  enters  a  cavity  resonator  which,  among  its  rather 
wide  range  of  radio  frequencies  (produced  by  conventional  radio  frequency  methods)  also 
contains  the  resonant  frequency  of  the  molecular  transitions. 

This  external  radiation  causes  molecular  transitions  which  are  associated  with  a 
line  of  small  width  (^  10  sec-i)  ata frequency  of  2.4  x  1010  sec^i  (that  is,  with  a  relative 
error  no  greater  than  10-Q),  corresponding  to  the  wavelength  of  1.27  cm.  This  wavelength 
is  determined  mainly  by  the  time  of  traverse  of  the  molecules  through  the  cavity  resonator. 

A  molecular  generator  produced  in  such  a  way  exhibits  extremely  stable  frequency. 
The  frequency  stability  is  so  high  that  molecular  clocks  constructed  on  this  principle 
have  an  accuracy  of  approximately  1  sec  per  300  years  of  continuous  operation.  * l 

On  the  basis  of  this  principle,  several  types  of  radio  receivers  working  in  the  micro- 
wave region  have  been  constructed.  They  all  use  paramagnetic  crystals  cooled  to  very 
low  temperatures.  These  devices,  known  as  quantum  paramagnetic  amplifiers,  greatly 
increased  the  sensitivity  of  radio  astronomical  and  radar  equipment. 

Recently,  atoms  and  molecules  have  been  used  as  generators  and  receivers  in  the 
visible  spectrum.  In  this  case,  we  speak  of  lasers  (light  amplification  by  stimulated 
emission  of  radiation). 

What  is  the  difference  between  an  ordinary  source  of  light  and  a  laser?  In  an  ordi- 
nary source  of  radiation,  for  example,  the  Sun,  the  spectrum  consists  of  a  broad  fre- 
quency band  and  the  individual  incoherent  quanta  have  arbitrary  phases  and  directions. 
Optical  lasers  enable  us  to  obtain  a  monochromatic  beam  of  light  of  high  intensity  (when 
it  is  focused  we  obtain  a  radiation  density  a  thousand  times  greater  than  that  obtainable 


The  stability  of  the  frequency  in  masers  made  it  possible  to  construct  two  standard 
independently  operating,  synchronized  clocks  These  clocks  were  used  to  measure  the 
velocity  of  light  in  one  direction  under  terrestrial  conditions  and  therefore  to  check  the 
fundamental  conclusions  of  the  special  theory  of  relatn'ty  regarding  first-order  effects 
(proportional  to  the  velocity  t>  —  i  /i). 

We  recall  that  all  interference  experiments  similar  to  the  Michclson  experiment  involve, 
in  effect,  only  one  standard  clock  and  are  used  to  measure  the  velocity  of  light  over  a 
closed  path  Thus,  only  second-order  effects  (proportional  to  /32)  could  be  detected  in 
these  experiments 
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by  focusing  sunlight),  great  coherence  and  extreme  sharpness  (with  telescopic  apparatus 
a  beam  of  light  reaching  the  Moon  would  have  a  diameter  of  the  order  of  3  km).  The 
first  laser  produced  by  Maiman  was  a  three-level  ruby  laser.  Chromium  atoms,  which 
are  present  in  ruby  (aluminum  oxide)  as  a  slight  impurity,  absorb  light  from  a  power 
klystron  over  a  wide  band  in  the  green  and  yellow  regions.  Initially,  these  atoms  give 
up  (without  emitting  radiation)  part  of  their  energy  to  the  crystal  lattice  and  make  a 
transition  to  a  metastable  state  from  which  dipole  transitions  are  forbidden,  and  hence 
they  can  be  maintained  in  that  state  for  a  comparatively  long  time  (of  the  order  of 
several  milliseconds). 
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Fig      27.11      Electrostatic    field    of    the 
capacitor  in  the  separating  system 


At  a  "negative"  temperature  (when  the  number  of  metastable  atoms  is  greater  than 
the  number  of  unexcited  atoms  and  tinder  the  action  of  stimulated  emission,  such  a 
system  is  capable  of  generating  almost  instantaneously  monochromatic  waves  with  a 
wavelength  of  6943  A  (red  regionX  'ITiese  transitions  are  induced  by  the  first  emitted 
quanta,  which  are  retained  in  the  ruby  (the  ruby  has  the  shape  of  a  circular  rod,  bounded 
by  silvered  parallel  ends,  one  of  which  is  semitransparentjL  Photons  moving  parallel  to 
the  axis  induce  the  emission  of  photons  having  the  same  frequency  and  moving  in  the 
same  direction.  Ihis  chain  process  intensifies  until  a  coherent  ray  finally  passes  through 
the  semitransparent  mirror  (after  multiple  reflection  from  the  ends). 

The  prospects  for  practical  application  of  lasers  are  enormous.  These  devices  can 
be  used  for  the  simultaneous  transmission  of  various  types  of  information,  the  establish- 
ment of  cosmic  communication,  the  control  of  chemical  reactions  induced  by  thermal 
excitation  and  so  forth. 
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Some  Applications  to  Nuclear  Physics 


Chapter  28 


Elastic  Scattering  of  Particles 


A.    TIME-DEPENDENT  PERTURBATION  THEORY 

In  the  case  when  the  Hamiltonian  is  an  explicit  function  of  time, 
it  is  not,  as  a  rule,  possible  to  obtain  an  exact  solution  of  the 
Schr'odinger  equation.  If  the  time-dependent  part  of  the  Hamiltonian 
V  (/)  is  small  in  comparison  with  the  time- independent  part  of  the 
Hamiltonian  H°,  time-dependent  perturbation  theory  can  be  used 
to  solve  the  problem. 

It  is  obvious  that  time-dependent  perturbation  theory  can  also 
be  applied  when  the  perturbation  energy  is  independent  of  time.  In 
this  case  we  obtain  the  same  solutions  as  in  the  stationary  per- 
turbation theory  of  Chapter  14, 

Including  the  perturbation  V'(t),  we  write  the  Schr'odinger  equa- 
tion as 

-f-T-  =  lH§  +  V'W]*(0.  (28.1) 

The  solution  of  this  equation  in  the  zeroth  approximation  (V  (t)  =  Q) 
is 

a-'V/^|f,  (28.2) 


where  the  wave  function  tyn  is  a  solution  of  the  unperturbed  time- 
independent  wave  equation 

E^  =  H^n9  (28.3) 

and  satisfies  the  orthonormality  conditions 

'M^W  <28-4) 

We  shall  look  for  a  solution  of  Eq.  (28,1)  in  the  form  of  Eq. 
(28.2),  assuming  that  the  coefficients  Cn  depend  on  time:  Cn-+Cn(t). 
This  approximate  method  of  solution  was  proposed  by  Dirac  and  Is 
known  as  the  method  of  variation  of  constants. 
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Substituting  Eq.  (28.2)  into  Eq.  (28.1)  and  using  Eq.  (28.3),  we 
obtain  the  following  equation  for  the  unknown  coefficients  Cn: 


-ie«* 


(28.5) 


l-E.t 

Multiplying  Eq.  (28.5)  by  tyl*e*  '"  ,  integrating  the  resulting 
expression  over  all  space,  and  using  the  orthonormality  condition 
(28.4),  we  obtain  a  system  of  equations  for  the  unknown  coefficients 

Cn' 

-T  Cn>=2Cn»e>«»n>n>'VM>  (t),  (28.6) 

n" 

where 

^..=Ejq^..  (28.7) 

and  the  matrix  element 


Dirac's  system  of  equations  (28.6),  taken  for  all  values  of  n'  9  is 
completely  equivalent  to  the  original  wave  equation. 

The  approximation  used  in  perturbation  theory  consists  in  ex- 
panding the  solution  in  the  form 

c».=ci.+c;>+  Q+....  (28.8) 

where  the  zero-order  coefficients  CJ*  are  independent  of  V.  The 
coefficients  for  the  first  and  higher  orders  will  be  proportional 
to  V,  V'\  and  so  forth. 

Substituting  Eq.  (28.8)  into  Eq.  (28.6)  and  retaining  only  the 
first-order  terms,  we  find  the  following  system  of  equations  for 
the  coefficients  Cn>: 

C°n,  =  0  (zeroth  approximation)  , 

(28.9) 

*"6U°n'n"v  *'*"(*)  (first  approximation)  and  so  on. 

The  first  of  the  equations  (28.9)  shows  that  the  unknown  zero- 
order  coefficients  must  be  independent  of  time,  that  is, 

CJ.=  const.  (28.10) 
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Their  values  are  given  by  the  initial  conditions  and  characterize 
the  state  of  the  electron  before  the  perturbation  is  applied. 

Let  us  assume  that  at  £  =  /0the  electron  is  in  the  state  n.  Then 
we  may  set 

C°,  =§„„„.  (28.11) 

This    expression   specifies   the  initial  conditions  of  the  problem. 
Substituting  Eq.  (28.11)  into  Eq.  (28,9),  we  find 


.('). 


(28.12) 


In  quantum  mechanics  one  generally  calculates  the  transition 
probability  w  per  unit  time.  Since  the  probability  of  finding  a 
particle  in  the  state  n'  is  given  by  |  Cn*  |*f  we  obtain  the  following 
expression  for  the  transition  probability: 


(28.13) 


Equations  (28.12)  and  (28.13)  are  the  basis  for  the  investigation 
of  many  quantum-mechanical  problems  in  the  first-order  time- 
dependent  perturbation  theory. 


B.  THE  CROSS  SECTION  FOR  ELASTIC  SCATTERING 

We  shall  now  apply  the  results  obtained  above  to  the  study  of 
the  elastic  scattering  of  an  electron. 

We  assume  that  at  the  initial  time  /o  =  0  the  particle  is  free; 
that  is,  it  moves  uniformly  with  a  momentum  p  =  Uk  and  an  energy 

*_*_        A  «o?\ 


At  the  instant  of  time  /0  — 0, 
the  particle  comes  within  the 
range  of  interactions;  that  is, 
it  moves  in  a  potential  V  (/•). 
The  particle  now  has  a  finite 
probability  of  making  a  tran- 
sition to  the  state  with  the 
momentum  p'  =  nkr  and  energy 

E=  cHK'  (/C'  =  ^);      that      is, 

the  particle  is  scattered  as 
a  result  of  interaction  (Fig. 
28.1). 


Fig.  28.1.  Scattering  of  a  particle  by  a  cen- 
ter of  force-  nk  is  the  momentum  of  the 
incident  particle,  fik  is  the  momentum  of 
the  scattered  particle,  &  is  the  angle  of 
scattering,  0  denotes  the  scattering  center 
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The  wave  functions  of  the  initial  and  final  states,  describing  the 
free  motion  (in  the  zeroth  approximation),  are  [see  (4.62)] 

v     *     ' 

where  L  is  the  period,  and  the  momentum  components  kt  and 
k't  (*=1,  2,  3)  are  related  to  the  integers  nt  and  n'i  by  means  of 
the  expressions 

*,  =  £'.     «  =  ?£.  (28.15) 

Substituting  the  wave  functions  (28.14)  into  Eq.  (28.12),  we  find  for 
the  coefficient  CV 


where  the  matrix  element  of  the  perturbation  energy  is  [(V"  =  V  (r)] 

V*  =  ^  e™V  (r)  d']x,  *  =  k  —  #. 
Hence,  the  transition  probability  is 


We  note  that  the  function 

f/0  /9R  1R. 

—  (28.18) 


has  a  sharp  maximum  for  sufficiently  large  values  of  t  and 
/C'—  •  /C->0.  This  means  that  in  practice  we  can  restrict  ourselves 
to  only  those  changes  of  K'  for  which  the  following  condition  is 
satisfied: 


Since  the  quantity  t  =  t  —  tQ  =  bt  is  the  time  elapsed  from  the 
initial  instant,  and  the  quantity  cft(Kf~K)=&E  is  the  energy 
spread  resulting  from  scattering,  we  find  a  relationship  between 
these  quantities 

A.  (28.19) 
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This  relation  can  be  considered  as  a  fourth  uncertainty  relation; 
it  is  usually  obtained  from  the  theory  of  transition  processes. 

The  uncertainty  in  the  energy  is  characteristic  for  any  wave 
process;  its  optical  analog  is  the  familiar  expression  for  the 
broadening  of  spectral  lines,  resulting  from  the  finite  duration  of 
the  emission. 

For  sufficiently  large  values  of  t(t->oo)  the  uncertainty  in 
energy  tends  to  zero  and  Eq.  (28.18)  becomes  a  statement  of  the 
law  of  conservation  of  energy 


This  explains  why  this  type  of  scattering  is  said  to  be  "elastic."  l 
Mathematically,  this  follows  from  the  fact  that  the  function  (28.18) 
is  a  8  function  at  t  =  oo>  Integration  of  the  &  function  leads  to  the 
replacement  of  Kf  by  /(.  To  show  this,  let  us  consider  the  integral 


/  _  J_  f  J^K^-KL  F  (K')  dK'.  (28.  20) 

71  a'       *  ~K 


Introducing  a  new  variable 

rf(/C  -/()=?, 
we  get 


-ctK 

If  the  function  F  has  no  singularities,  we  obtain  as  /->oo 

+  00 


On  the  other  hand,  from  the  definition  of  the  B  function  it  follows 
that 


^  &(/£'  —  K)F(Kr)dK'  =  F(K).  (28.21) 


An  example  of  inelastic  scattering  is  provided  by  bremsstrahlung— a  scattering  process 
in  which  an  electron  emits  a  photon,  so  that  K   ^  K. 
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Hence  it  is  clear  that  expression  (28.18)  becomes  a  8  function  as 
t->oo,  and,  consequently,  Eq.  (28.17)  for  the  transition  probability 
can  be  written  as 


).  (28.22) 

Replacing  the  summation  (28.22)  by  integration  we  use,  in  accord- 
ance with  Eq.  (28.15),  the  following  relation: 

|  J8  =  k'"  dVdQ  =  k«k'dK'dQ.  (28.23) 

The  scattering  process  is  usually  characterized  by  a  cross  sec- 
tion, which  is  equal  to  the  ratio  of  the  probability  w  to  the  number  of 
particles  N  incident  per  unit  time  on  a  unit  surface  S  perpendicular 
to  the  incident  beam.  Obviously  the  particles  that  strike  this  sur- 
face per  unit  time  are  those  located  at  a  distance  not  exceeding 
the  velocity  of  the  particles  v9  that  is,  the  particles  contained  in 
the  volume  vS  =  v.  The  number  A/  is  equal  to  the  number  of  parti- 
cles per  unit  volume  p0  =  £  3  multiplied  by  a  volume  which  is 
numerically  equal  to  the  velocity  of  the  particle 

"=£=£•£•  <28-24> 

With  the  aid  of  Eqs.  (28.22)  -  (28.24),  we  find  the  following 
expression  for  the  scattering  cross  section: 

(28.25) 

The  integrand  characterizing  the  number  of  scattered  particles 
incident  per  solid  angle  d$(d£  =  smbdbdyt  where  &  and  9  are  the 
spherical  scattering  angles),  known  as  the  differential  cross  sec- 
tion, is  equal  to 

'  <28-26> 


In  particular,  when  the  scattering  center  is  spherically  symmetric, 
we  have 


where  dQ'  is  the  solid  angle  associated  with  the  vector  r,  whereas 
in  Eq.  (28.25)  dQ  is  the  solid  angle  associated  with  the  vector  k'. 
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Integrating  the    last   expression   over   the   solid  angle  dQ'9  we 
find 

l/x=4*      rs[n*rV(r)dr. 


From   this    it   is    clear   that   the   differential  cross  section  of  the 
elastic  scattering  is  equal  to 


(28.27) 
where 

x=  |A  —  kr\  =  2ksin~1  (28.27a) 

and  the  quantity 

(28.28) 


is  called  the  scattering  amplitude. 

Equation  (28.27),  describing  the  elastic  scattering  of  the  par- 
ticles by  a  center  of  force  V  (r)  in  first-order  perturbation  theory, 
was  originally  developed  by  Born;  it  is  therefore  called  the  Born 
approximation. 

We  note  that  this  problem  can  be  also  solved  in  the  time- 
independent  perturbation  theory,  since  the  potential  energy  of  in- 
teraction is  time-  independent.  To  obtain  the  scattering  cross 
section  we  have  used,  however,  the  time-dependent  perturbation 
theory,  the  mathematical  apparatus  of  which  is  comparatively 
simple  but  is  much  more  general.  In  particular,  it  can  be  used  to 
solve  many  problems  in  modern  quantum  electrodynamics,  taking 
account  of  the  interaction  of  electrons  with  the  doubly  quantized 
electromagnetic  field  (see  Chapter  29). 

The  expression  for  o(0),  obtained  from  the  perturbation  theory, 
is  applicable  only  within  some  definite  limits.  In  the  case  of  short- 
range  forces  (nuclear  forces,  neutral  atom,  impenetrable  spheres, 
and  so  forth),  which  can  be  neglected  for  r  >  /?,  where  R  is  a  cer- 
tain effective  radius,  the  magnitude  of  the  cross  section  must  be 
either  less  or  of  the  same  order  as  their  geometric  cross  section 
rc/?2  (even  if  these  forces  create  an  absolutely  impenetrable  barrier). 
For  short-range  forces,  therefore,  the  range  of  applicability  of 
the  perturbation  method  is  given  by 

o<>/?2.  (28.29) 

Equation  (28.  27)  is  not  applicable  to  long-range  forces  (Coulomb 
forces)  if  the  scattering  angle  is  small.  This  question  requires 
a  more  detailed  analysis  (see  below). 
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C.  SCATTERING  BY  THE  YUKAWA  CENTER  OF  FORCE 

The  form  of  the  potential  energy  for  the  Yukawa  Interaction  is 
as  follows: 

V=-Ae~**r.  (28.30) 


where  A   is  a  constant  related  to  the  charge  and  /?==]--  is  the  effec- 

KO 

tive  range  of  these  forces.  The  interaction  (28.30)  has  some  im- 
portant applications  in  the  theory  of  nuclear  forces  and,  in  partic- 
ular, in  meson  theory. 

For  nuclear  forces,  the  quantity  A  is  equal  to  g*  ,  where  gl 
determines  the  strength  of  the  potential.  The  range  of  nuclear 
forces  is  equal  to  the  Compton  wavelength  of  a  pion 

/?  =  -*    —10  13cm.  (28.31) 

»«„<•  v          ' 

In  the  case  of  fast-electron  scattering  (or  a  -particle  scatter- 
ing) by  neutral  atoms,  the  interaction  potential  given  by  the  Thomas- 
Fermi  model  can  be  approximated  by  the  expression  (28.  30).2  In 
this  case  the  quantity  A  =  Zel,  where  Z  is  the  atomic  number,  and 
the  effective  radius  R  of  the  atom  in  the  Thomas-  Fermi  model  is 
equal  to 

R  =  -fh*  (28.32) 


where  f  is  a  numerical  factor  of  the  order  of  unity. 

Finally,  setting  R—  *oo,  we  obtain  the  Coulomb  potential  of  the 
nuclear  field  which,  consequently,  can  be  considered  as  a  limiting 
case  of  Eq,  (28.30). 

Substituting  Eq.  (28.30)  into  (28.27)  and  using  the  relation 


we  obtain  the  following  expression  for  the  differential  cross  section 
of  elastic  scattering: 

(28.33) 


As  has  been  mentioned  in  Chapters  25  and  27,  a  more  accurate  approximation  of  the 
Thomas-Fermi  potential  is  given  by  the  expression  (25  22)  The  results  of  the  two  approx- 
imations, however,  do  not  differ  greatly  from  one  another  (which  is  a  consequence  of  the 
short-range  character  of  the  forces).  The  approximation  (28  30)  is  more  convenient  in 
calculations  of  scattering. 
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Here,  according  to  Eq.  (28.27a), 

n  5  n 

x*  =  4#2sin2  -£-  =  4  ^,-  s  i  n*  y  , 

where  p  is  the  momentum  of  the  particle. 

Two  cases  should  be  distinguished  in  the  analysis  of  Eq.  (28.33). 

1.  Scattering  of  relatively  slow  particles,  when  */?<*!  for  all 
scattering  angles.    In  this  case  Eq.  (28.33)  shows  that  a  (ft)  is  inde- 
pendent of  the  angle  ft  and  becomes  equal  to 

(28.34) 

The  independence  of  the  scattering  cross  section  of  the  angle  ft 
(isotropy)  is  a  characteristic  feature  of  scattering  of  relatively 
low-  energy  particles  by  a  center  of  short-range  forces. 

2.  In  the  case  of  relatively  fast-particle  scattering,  the  differ- 
ential cross  section  becomes  independent  of  R  (the  effective  range 
of  force)  for  all  scattering  angles  satisfying  the  condition  x/?  V  1. 
Equation  (28.33)  in  this  case  reduces  to 

(28.35) 

It  is  evident  that  for  angles  satisfying  this  condition  the  scattering 
by  the  Yukawa  potential  is  the  same  as  the  scattering  by  a  Coulomb 
center.  Therefore,  in  the  case  of  fast  electrons  or  *-  particle 
scattering  by  a  neutral  atom  through  comparatively  large  angles, 
the  influence  of  atomic  electrons  is  not  important  and  the  scatter- 
ing is  determined  only  by  the  potential  of  the  nucleus. 

Setting  A=Ze]  and   x  =  -^-sin-|    in   Eq.  (28.33),  we  obtain  the 
familiar  Rutherford  equation 

x        Z*eM 

(28.36) 

x  ' 


which  was  obtained  by  the  classical  method  in  Chapter  2.  Equation 
(28.36)  shows  that  for  long-range  forces  there  is  a  strong  de- 
pendence of  the  scattering  cross  section  on  the  scattering 
angle  ft  . 

However,   for   any  large  values  of  the  wave  vectors  k  =  £  we 

can  always  find  small  angles  ft  such  that  the  following  inequality 
is  true: 

-sin<l.  (28.37) 
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In  particular,  as  ft— »0,  Rutherford's  formula  gives  a  divergent 
value  for  <*(&);  in  this  case  we  must  take  into  account  the  short- 
range  character  of  the  forces  resulting  from  the  screening  action 
of  the  electron  shell.  The  condition  (28.37)  now  determines  the 
region  of  inapplicability  of  the  Rutherford  formula. 

For  0  =  0,  that  is,  for  forward  scattering  (x==0),  we  find  from 
Eqs.  (28.32)  and  (28.33)  the  following  expression  for  the  differential 
cross  section: 

a  (0)  =  4TX^s/«  —  flJZ'/i.  (28.38) 

d-t-O 

The  total  cross  section  can  be  obtained  from  Eq.  (28.33) 


sinfo/fc 


[1+2W(1  —  costyf  ~ 
=   ™™W  L_  ,2839v 

ft*  4^8D8     j.   J    •  \4OoOiJ) 

Finally,  with  the  aid  of  Eq.  (28.29),  we  can  find  the  range  of 
applicability  of  the  perturbation  method  for  our  problem  in  the  two 
limiting  cases  considered  above 


<28*40) 


Represent  the  cross  section  for  scattering  of  particles  by  a  spherically 
s  y  mtliu  11  ic  |JU  Umti  al  as  a  sum  of  partial  cross  sections  (the  sum  of  the  cross  sections 
for  waves  with  a  well-defined  value  of  the  orbital  quantum  number  /  ).  Obtain  the  scatter- 
ing cross  section  for  the  general  case  and  for  the  Born  approximation, 

Solution.  Suppose  the  incident  particle  has  a  momentum  p  =  nk  and  velocity 
v=zftk/m0,  directed  along  the  2  axis. 

The  wave  function  of  the  incident  particle  is  of  the  form  of  a  plane  wave 


A  plane  wave  can  be  expanded  in  spherical  harmonics  (see  Problem  21.5).  Then,  using 
the  asymptotic  expression  for  the  Bessel  function  as  r  —  *oo 


'    .W-l/l 

/-|-  Y          7U 

the  incident  wave  can  be  represented  as 


/~o-   sm  (kr  — 

=      ~  —  \ 


V        *(*--?•) 

'=    -7."«'-i-n L_ !/_P/(cos8).  (28.41) 
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In  the  presence  of  the  potential  energy  I/ (r),  the  asymptotic  expression  for  the  wave  func- 
tion of  the  particle  in  a  centrally  symmetric  field,  in  accordance  with  Eqs.  (13.75)  and 
(13.78),  should  be  chosen  in  the  form 

oo 

n  s 

C/P/(cos8)  - 


where  the  phase  shift  &z  can  be  determined  from  the  asymptotic  solution  of  the  Schrodinger 
equation  for  the  radial  function  in  the  presence  of  the  potential  V  (r) 


Clearly  the  scattered  wave  is 


X 


01 -< 

The  unknown  coefficients  C/  can  be  determined  from  the  condition  that  the  f unction  fys 

must    be    a    diverging    spherical    wave.     Thus   the  coefficient  of  the  converging  wave" 

£  2  '  must  be  equal  to  zero.  ITien 

d;  = 

Tscat         , 

The   function   /(&)  is  the  scattering  amplitude  [see  (28.28)],  which,  according  to  the 
exact  theory,  is  equal  to 

00 

~  (28.43) 


The  differential  cross  section  characterizing  the  scattering  of  particles  through  an 
angle  §  is  equal  to  the  ratio  of  the  probability  of  scattered  particles  passing  per  unit 
time  through  an  element  of  the  spherical  surface  dS  = 


to  the  number  of  particles  incident  per  unit  time  on  a  unit  surface  perpendicular  to  their 
velocity,  that  is,  perpendicular  to  the  z  axis 


From  this  we  find  the  differential  cross  section 

*  sinfo/d.  (28.44) 

mc 

Here,    assuming  the  scattering  field  to  be  axially  symmetric,  we  set  the  solid  angle 
equal  to 

dQ  =  27c  sin  9dd. 
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Substituting  the  value  obtained  for  the  scattering  amplitude  and  using  the  orthonormahty 
property  of  Legendre  polynomials,  after  integrating  over  the  angles  we  get 

/McostyP^cosOJsini 


which  gives  us  the  following  expression  for  the  total  cross  section: 

oo 


(2M5) 

Z=0 

This  expression  for  a  is  the  desired  sum  of  partial  cross  sections. 

Let  us  compare  the  expression  for  the  scattering  amplitude  found  in  the  Born  approx- 
imation [see  (28.28)]  with  the  exact  expression  (28.43JL  The  comparison  shows  that  the 
exact  expression  gives  the  same  result  as  the  Born  approximation  for  small  values  of 
the  scattering  phase  angles  8/.  Indeed,  whenfy  <  1,  expression  (28.43)  becomes 

00 

<  (  cos  8)  '  (28-45a) 

Solving   Eq.    (28.42)  for  a  given  /    (for  a  partial  wave)  by  perturbation  methods,  we  can 
show  that 


i  (kr)dr*  (28.46) 

T 

o 
Next,  using  the  expansion 


where 

7  =  2k  sin  y  , 

the  scattering  amplitude  (28.45a)  can  be  reduced  to  the  form  (28.28)  found  in  the  Born 
approximation. 

^Problem  2 8f  ^Determine  the  cross  section  for  scattering  of  particles  by  a  spherically 
symmetric  potential  barrier  of  height  1/0>  0  and  radius  a 

V-fVo  for 
v  ~~  I  0     for 

when  the  radius  a  is  much  less  than  the  de  Broghe  wavelength  of  the  scattered  particles, 
that  is,  when  ka-'  1. 

Show  that  in  this  case  the  s  wave  (wave  with  /  —  0)  is  the  main  contributor  to  the 
scattering  process. 

Compare  the  exact  solutions  with  those  obtained  by  perturbation  methods. 

Solution.  It  may  be  seen  from  Eq.  (28.46)  that  at  ka  <  1,  the  s  wave  (/  =  0)  is  the  main 
contributor. 

Solving  the  problem  by  perturbation  methods  (Born  approximation),  that  is,  using 
Eq.  (28.28),  we  find  the  scattering  amplitude 
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Consequently,  the  cross  section  in  the  Born  approximation  is  equal  to 


(28.47) 


Let  us  determine  the  scattering  phase  shift  fy  in  this  simple  problem.  We  restrict 
ourselves  to  a  determination  of  the  phase  shift  for  /  =  0  and  low  energies  Et  when  we  can 
set  /?#<  1. 

When  /  =  0,  the  wave  equations  have  the  form 


=  0     forr>at 
XJ  —  *'*/.„  =  0    for  r<a, 
where 

X.  =  *.!>,    k*  =  ^E,    i'1^^.  -/•)  =  /•-*. 

Using  the  boundary  conditions  X0  (0)  =  0,  the  solution  of  Eq.  (28.48)  can  be  represented 
as 


_  I  A  sinhx'r        for  r  <  a, 
~  \  sin  (kr  +  80)   for 


Equating  the  wave  functions  and  their  derivatives  at  the  boundary  of  the  region  r  =  0§  we 
find  in  our  case  of  small  E  (E  <^  1/0) 


where 


0  =  arc  tan  \  J-  tanh  *'fl  ]  -ka^ka  (^^  -  1  )  ,  (28-49) 


Hence,  according  to  Eq.  (28.45),  we  have 

0  =  4na'(^-.)S.  (28.49a) 

In  the  case3 

*0  <  1  (28.50) 

we  can  set 


Then   Eq.   (28.49)  gives  an  expression  for  a    corresponding  to  the  Born  approximation 
[see  (28.47)]. 

Whenxa  >  1  (that  is,  whenl/0  -*  oo)  the  cross  section  reaches  its  maximum  value 

(28.51) 


We   note   that   the  condition   (28.50),    which   in   this   problem  can   also   be   written  as 

0B  <£  no2  , 
is  equivalent  to  the  condition  (28.29)  for  the  applicability  of  perturbation  method 
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The  last  expression  is  four  times  greater  than  the  corresponding  cross  section  for 
elastic  scattering  by  an  impenetrable  sphere,  calculated  according  to  classical  mechanics 
when  i  is  determined  simply  by  the  geometric  cross  section  of  the  sphere  *a\  This 
discrepancy  is  due  to  the  appearance  of  wave  properties  (diffraction)  of  the  scattered 
particles. 

— '*'  *X 

Problem  28.D   Determine  the  cross  section  for  scattering  of  slow  particles  (^a<  1) 

b^TSphCPWaTTysymmetric  potential  well: 


:1    on 


[  —  V0     for  r<a, 
"        for 


=  0    for  f 

=  0    for  r<a, 


Indicate  what  distinguishes  the  cross  section  for  scattering  of  particles  by  a  potential 
well  from  the  cross  section  for  scattering  by  a  potential  barrier. 

Solution.  In  the  exact  solution  of  the  problem,  Eq.  (28.48)  should  be  replaced  by 


where 


The  phase  shift  &0  is  given  by  the  following  expression  [instead  of  Eq.  (28.49)]: 

R0  =  arc  tan  (  A  tan  k'a\  —  ka  .  (28.54) 


When  £'0<!  1  and  £<  V0,  we  again  obtain  Eq.  (28.47),  which  was  derived  in  first- 
order  perturbation  theory.  In  this  case  the  cross  sections  for  scattering  by  a  potential 
well  and  by  a  potential  barrier  are  identical. 

The  difference  appears  when  the  quantity  x<2  fx2  =  ?/;^  l/0  j  becomes  comparable  to  or 
greater  than  unity.  Thus,  in  the  case  of  a  potential  barrier,  the  quantity  --  f  monoton- 
ically  approaches  zero  as  V9  increases,  whereas  in  the  case  of  a  potential  well  the 

vanes    periodically  over  the   range  from  0  to  oo  as 


corresponding    quantity 
I/o  increases. 

In  particular,  if  the  quantity  k'a  approaches  y ,  we  find  the  following  expression  for 

the  cross  section  in  the  region  <  ka : 

tan  k'a  ^ 


Because  of  the  smallness  of  the  quantity  k*a*=--  a3,  the  expression  for  a  is  much 

greater  than  the  maximum  value  for  the  cross  section  in  the  case  of  scattering  by  a 
potential  barrier  (^max  =  4*fl2  when  Vo— co  [see  (28.51)]).  Since  the  relation  k'a  = 

=  ]/"x2a3  +  ^8fl8  =  -j  for  ka  <  1  is  actually  equivalent  to  the  condition  of  appearance 
of  the  first  level  in  the  spherically  symmetric  potential  well,xa  ^  ^  9  the  cross  section 
(28.55)  corresponds  to  the  case  of  "resonant"  scattering.  Subsequent  resonance  maxima 

rt  f> 

of  the  cross  section  occur  when  *o  =  --.  «,  y  n  ,  and  so  forth. 

Note.  The  scattering  phase  shift,  together  with  the  expression  for  the  cross  section, 
can  be  accurately  determined  for  a  very  limited  class  of  problems  (scattering  by  a 
spherically  symmetric  potential  barrier  or  potential  well,  scattering  by  a  potential  in- 
versely proportional  to  the  square  of  the  distance,  and  so  forth).  However,  In  the  general 
case  we  must  use  approximate  methods.  For  intermediate  values  of  the  potential  energy  V0 
(for  instance,  in  the  case  of  scattering  of  charged  particles  by  a  Coulomb  field  the  constant 
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Z£* 

characterizing  the  potential  energy  must  satisfy  the  condition  -=—  <^  1)  perturbation 

nv 

theory  (that  is,  the  Born  approximation)  gives  good  results.  In  cases  when  the  Born 
approximation  is  no  longer  applicable,  the  phase  can  be  determined  by  other  approximate 
methods,  for  example,  the  WKB  method  [see  Eq.  (13.78)],  the  variational  method  and 
so  forth.  In  view  of  the  special  nature  of  these  methods  we  shall  not  discuss  them  in  this 
general  treatment.4  It  should  be  noted  that  many  qualitative  features,  which  are  char- 
acteristic of  the  scattering  of  particles  by  various  potentials,  are  very  well  illustrated 
in  the  scattering  by  a  potential  barrier  or  a  potential  well. 


4The  theory  of  scattering  is  given  a  more  extended  treatment  in  N.  Mott  and  G.  Massey, 
The  Theory  of  Atomic  Collis  ions,  New  York.  Oxford,  1949,  L.  Schiff,  Quantum  Mechanics, 
New  York-  McGraw-Hill,  1955. 


Chapter  29 

Second  Quantization 


A.    SECOND  QUANTIZATION  OF  THE  SCHRODINGER 
EQUATION 

As  an  example  of  second  quantization,  we  shall  consider  the 
nonrelativistic  Schrodinger  equation,  and  then  generalize  the  re- 
sults to  the  case  of  the  Maxwell  and  Dirac  relativistic  equations. 

In  Chapter  5,  it  was  shown  that  the  solution  of  the  time-dependent 
Schrodinger  equation  in  the  general  case  can  be  represented  in  the 
form 

-3" 

(29.1) 


where  the  coefficient  Cn(t)  ,  which  characterizes  the  probability 
that  a  particle  is  in  the  state  represented  by<J>rt,  includes  the  time- 
dependent  part  of  the  wave  function 


- 
Cn(t)  =  Cne     n    .  (29.2) 

Here  the  quantities  En  are  the  energy  eigenvalues,  and  the  eigen- 
functions  tyn  of  the  time-  independent  Schrodinger  equation  satisfy 
the  orthonormality  condition 


M3*==8n«'  (29.3) 

In  transferring  from  classical  quantities  to  quantum-mechanical 
ones,  the  position  .v,  the  momentum  pv,  and  the  remaining  quantities 
are  replaced  by  their  average  values 


(29.4) 
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The  time  variation  of  these  average  values  cannot  be  determined 
from  the  classical  Poisson  brackets,  which  in  this  case  coincide 
with  Hamilton's  canonical  equations 

_    dH 


dt    —  dPcl  ' 
and  must  be  calculated  from  the  commutator  relation 

i  =  ^(HF=n?lT).  (29.5) 

Here  the  Hamiltonian  is  equal  to 

"HSr+^w-  (29.6) 

The  quantum  equation  of  motion  [Eq.  (29.5)]  can  be  considered 
as  the  fundamental  equation  describing  the  quantization  of  the 
classical  equations  of  motion.  This  process  is  called  first  quanti- 
zation. In  order  to  connect  p  and  x  by  the  usual  relation 

i==«7-  <29'7> 

we  must  assume  that  in  Eq.  (29.5)  the  operators  p  and  x  do  not 
commute  with  each  other  and  obey  the  commutation  relation 

p.v-A'p=4'  <29'8) 

Thus  the  commutation  relation  (29.8),  which  is  basic  to  the 
Schrbdinger  theory,  can  be  regarded  as  a  consequence  of  the 
quantum  equation  of  motion  (29.5),  and  the  transition  from  the 
classical  equation  of  motion  to  the  Schrcdinger  equation  is  equiva- 
lent to  a  transition  from  corpuscular  concepts  to  the  wave  con- 
cepts. 

In  order  to  include  in  the  theory  the  corpuscular  properties 
of  the  de  Broglie  waves,  it  is  necessary  to  introduce  a  number 
of  additional  hypotheses  (for  example,  a  hypothesis  concerning 
the  probabilistic  nature  of  the  <[>  waves  and  the  meaning  of  the 
coefficients  ]  Cn  \  *  as  the  probability  of  an  electron  being  in  state 
n  ,  and  so  forth). 

The  process  of  second  quantization  enables  us  to  take  into 
account  both  the  corpuscular  and  wave  properties  of  the  particles. 

The  name  "second  quantization"  originated  from  the  fact 
that  in  this  case  we  quantize  the  equation  which  has  already  been 
quantized  as  a  result  of  the  first  quantization.  We  note  that  as  a 
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result  of  second  quantization,  the  coefficients  Cn  become  operators 
(q  numbers),  whereas  in  the  Schrbdinger  theory  they  remained 
ordinary  constants  (c  numbers). 

In  order  to  carry  out  the  second  quantization,  it  is  necessary 
to  find  the  average  value  of  the  energy  operator 

.  (29.9) 

Substituting  (29.1)  in  place  of  ty(t)  and  recalling  that  Htyn  =  £rt<|»n  f  we 
find 


Taking   into   account   the  orthonormality  condition  Eq.   (29.3),  we 
obtain 


t).  (29.10) 

We  note  that  the  last  expression  is  independent  of  t,  since  it 
is  apparent  from  the  equations 


that   the   time  factor  in  the  product  C*(t)-Cn(t)  is  simply  equal  to 
unity 

C*(t)Cn(t)  =  CnCn. 

Expression  (29.10)  is  not  an  operator  in  the  Schrbdinger  theory, 
since  the  coefficients  Cn  are  ordinary  numbers  (c  numbers).  In  the 
theory  of  second  quantization,  however,  these  quantities  and  the 
Hamiltonian  (29.10)  should  be  regarded  as  operators,  that  is,  as 
q  numbers. 

To  find  the  commutation  relation  for  the  coefficients  Cn  ,  we 
must  substitute  Cn(t)  and  C£(t)  in  place  of  x  and  pin  the  quantum 
equation  of  motion_  (29.5),  and  the  operator  H  must  be  replaced  by 
its  average  value  //.  We  get 

=      (»Cn(<)  -  Cn(t)H).  (29.12) 


According  to  Eq.  (29.11), 
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and  therefore  from  (29.12)  we  find  the  following  relation,  which  is 
a  fundamental  postulate  of  the  second  quantization: 

-  LvCV  =  IlCn,  -  Cfl.//.  (29. 13) 

Substituting  H  from  Eq,  (29.10),  we  obtain 

—  Bn'Cn'  =  ^  En  (C*nCnCn'  —  Cn'CnCn).  ^9. 14) 

n 

The  last  equation  has  two  solutions,  corresponding  to  the  Bose 
statistics  and  Fermi  statistics,  respectively.  The  first  solution  is1 

Cn'Cn  —  CnCn'=Qt  q    -     . 

c,c*      c*c    _*  (29.15) 

^n'^rt         Wi^rt'  —  Qnn'* 

This  can  be  easily  verified  by  substituting  the  expressions 

C>  =  C>C 

(29. ID) 

' 

into   the    first  and  second  terms,  respectively,  on  the  right-hand 
side  of  Eq.  (29.14).  We  then  obtain  the  identity 


From  Eq.   (29.15),  it  follows  that  the  coefficients  Cn  and  CJ  are 
operators.  Setting 

C;Cn  =  Af,  (29.17) 

where  N  is  the  number  of  particles  in  the  state  n  ,  we  find  from 
Eqs.  (29.15) 

CnCH=\+N.  (29.18) 

.«'' 

It  follows,  ^herefore,  that  these  operators  do  not  vanish  even  in  the 
case  where  there  are  no  particles  present.  Indeed,  even  though 
CiCn  =  o  when  N  =  Q9  we  still  have  in  that  case  CBC*  =  1.  This 
nonzero  value  for  the  combination  of  coefficients  C  gives  rise  to 
a  relationship  between  the  vacuum  (the  field  of  virtual  particles) 
and  the  real  particles. 


llt  follows  from  Eq    (29  15)  that  the  wave  functions  l/f(r,  t)  and  0(r',  £)  commute,  while 
and  i/j*  do  not  commute 

V>(r,  t)  0"(r'f  0-  <AV,  0  001.  0   =   S0n*(r')  ^rn(r)   -    S(/-  r) 
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It  is  clear  from  the  above  equations  that  the  operator  C£  should 
be  regarded  as  the  "creation"  operator  for  particles,  and  Cn 
as  the  "destruction"  operator.  If  at  the  initial  time  there  are  no 
particles  present,  the  condition  CnCn  =  \  signifies  that  a  particle 
can  first  appear  (owing  to  the  action  of  operator  CJ),  and  then 
disappear  (owing  to  the  action  of  Cn).  The  condition  C*nCn  =  0  im- 
plies that  in  the  absence  of  particles  the  inverse  process,  in  which 
a  particle  is  first  absorbed  and  then  emitted,  cannot  occur. 

In  the  solutions  (29.17)  and  (29.18),  the  quantity  N  must  be  a 
positive  integer.  It  is  not  restricted  to  any  maximum  value. 
Therefore,  these  commutation  relations  correspond  to  the  Bose 
statistics,  which  allows  any  number  of  particles  to  be  present  in 
one  state. 

The  second  solution  of  Eq.  (29.14)  can  be  represented  as 


which  is  easily  verified  by  the  direct  substitution  of  Eq.  (29.19) 
into  Eq.  (29.14).  Setting 

C*nCn  =  Nt  (29.20) 

we  find 

CnCJ=l—  JV.  (29.21) 

Noting  that  C*Cn  and  CnC*  cannot  be  negative,  we  find  that  the  number 
of  particles  in  the  state  n  can  assume  only  two  values:  Af  =  0  and 
N=l.  Consequently,  the  Pauli  exclusion  principle  is  already  in- 
cluded into  this  solution  and  the  particles  obey  the  Fermi  statistics. 
In  particular,  if  there  are  no  particles  present  initially  (/V  =  0),  then 
just  as  in  the  case  of  the  Bose  statistics  we  have 

C*Cn  =  0,    CnC;=L  (29.22) 

The  equation  obtained  in  second  quantization  describes,  there- 
fore, a  state  with  a  variable  (integral)  number  of  particles.  Con- 
sequently, electrons  will  be  similar  to  photons  not  only  because 
their  motion  is  described  by  a  wave  equation,  but  also  because 
electrons,  just  like  photons,  can  be  created  and  destroyed. 

Since  the  creation  and  annihilation  of  electrons  requires  an 
energy  more  than  twice  as  great  as  the  rest-mass  energy  of  an 
electron  (since  an  electron  is  always  created  together  with  a 
positron),  the  second  quantization  of  thenonrelativisticSchrodinger 
equation  is  of  purely  methodological  significance.  In  order  to 
consider  real  wave  processes  associated  with  the  creation  or 
annihilation  of  particles,  one  should  extend  the  method  of  second 
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uantization  to  relativistic  equations— namely,  the  wave  equation 
DP  photons  (Maxwell's  equations)  and  the  relativistic  wave  equation 
3r  electrons  and  positrons  (Dirac's  equation). 


B.    QUANTIZATION  OF  MAXWELL'S  EQUATIONS 

It  is  well  known  that  the  photon  field  (electromagnetic  field)  can 
e  described  by  a  vector  potential  which  satisfies  the  d'Alemberts 
quation 


a-  =  0,  (29.23) 

nd  is  subjected  to  the  following  condition: 

V  •  A  =  Q.  (29.24) 

Since  the  vector  potential  A  is  a  real  quantity,  we  can  represent 
he  solution  of  (29.23)  as 


-•'*')»  (29.25) 

t,   -  ^^  ' 

X 

yhere 

*  =  |*|,  and   **  =  j/h  (nt  =  0,  drl,  :±2,  =b3...). 

The  condition  (29.24)  means  that  the  vector  a  is  perpendicular 
o  vector  x  ,  that  is, 

x-a  =0.  (29.26) 

The  normalization  coefficient  — ^  I/  -~~—    is  determined  from 
he  condition  that  the  energy  of  the  electromagnetic  field 


(29-27) 


nust  be  equal  to  the  sum  of  the  products  obtained  by  multiplying  the 
mergies  chf.  of  the  individual  particles  by  the  corresponding  squared 
.mplitudes. 

Since  the  Hamiltonian  is  proportional  to  the  combinations  of 
coefficients  of  the  formC*C  +  CC*  (the  amplitudes  of  a  in  this  case 
)lay  the  role  of  the  coefficients  C),  the  quantum  equation  of  motion 
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(29.13)  for  the  field  of  photons  permits  only  a  solution  corresponding 
to  the  Bose  statistics.2  Furthermore,  taking  into  account  the  con- 
dition (29.24),  we  obtain  the  following  commutation  relations: 

(29.28) 

where  i.  i'  =  l,  2,  3  and  the  amplitudes  at  and a"V  refer  to  the  same 
vector  x  ,  which  plays  the  role  of  the  quantum  number  n  [see  Eqs. 
(29.15)].  The  coefficient  Aw  expresses  the  transverse  character  of 
the  photon  field 

v4,f,  =  xM,,  =  0.  (29.29) 

In  particular,  if  there  are  no  photons  present  at  the  initial 
instant,  only  the  following  bilinear  combination  of  amplitudes  will 
be  nonvanishing: 

a  tai  =  *u,-^.  (29.30) 


C.    SPONTANEOUS  EMISSION 

In  Chapter  9  we  found  the  Einstein  coefficients  A  for  the  spon- 
taneous emission  by  using  the  correspondence  principle.  In  such 
treatment,  the  reasons  for  transitions  of  electrons  from  higher  to 
lower  levels  remained  unexplained  (see  Chapter  9).  Quantum 
electrodynamics  (the  name  given  to  the  theory  that  includes  second 
quantization  of  the  electromagnetic  field)  explains  the  transitions  in 
terms  of  the  interaction  of  an  electron  with  the  field  of  vacuum 
(virtual)  photons. 

As  can  be  seen  from  Eq.  (29.30),  there  are  quadratic  com- 
binations of  the  quantized  amplitudes  of  the  electromagnetic  field 
which  differ  from  zero  even  in  the  absence  of  real  photons. 

The  time-  dependent  Schrodinger  equation,  taking  into  account  the 
doubly  quantized  field  of  photons,  can  be  represented  as 

-o.  <29-31> 


Neglecting  the  second-order  terms  proportional  to  A*,  and  taking 
into  account  the  transverse  character  of  the  electromagnetic  waves 

(29.32) 


2If  these  amplitudes  occur  in  the  Hamiltonian  in  the  combination  C*C  —  CC*  (this 
happens,  for  example,  for  Dirac  particles),  then  only  the  Fermi  statistics  would  hold  for 
the  corresponding  amplitudes  This  is  easily  verified  by  substituting  into  the  Hamiltonian 
the  different  solutions  for  the  Bose  statistics  [see  Eq.  (29.16)]  and  for  Fermi  statistics 
[see  Eq.  (29.19)]. 
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since 

V"4  =  0, 
Eq.  (29.31)  can  be  reduced  to 

=  0.  (29.33) 


Here     the     unperturbed     Hamiltonian     H0=l/4-y7-P*     is    time- 
independent,  and  the  operator  of  the  perturbation  energy  is 


Let  us  assume  that  the  electron  is  initially  in  the  state  n  .  Then 
under  the  influence  of  virtual  photons  3 

"«'.  p  (29-34) 


the  electron  can  jump  into  the  state  nr  . 

In   accordance   with   Eq.    (28*12),   we   obtain  the  following  ex- 
pression for  the  coefficient  Cn*(t),  characterizing  this  transition: 

«'.,.,.,  (29.35) 

A 


cofl/l, 
x 

where  the  matrix  element 


n'n  =  J  tS' 


(29.36) 


and  a>=rc*  is  the  frequency  of  the  emitted  photons. 

The  quantity  x  •  r  ~  j  is  small  since  the  wavelength  of  the  emitted 

light  is  \^  10  5  cm,  and  the  dimensions  of  the  atom  are  of  the  order 
of  r^  10  -"  cm.  Therefore,  in  a  first  approximation,  the  exponential 
factor  in  Eq.  (29.36) 

erl**f=\—  /x-r-f.  .. 

should  be  set  equal  to  unity.  Such  transitions  are  called  the  dipole 
transitions.4 


3Taking  into  account  the  commutation  relations  in  (29  30)  in  the  expression  for  the 
vector  potential  A  in  the  case  when  there  are  no  photons,  we  should  retain  only  amplitudes 
proportional  to  a*,  that  is,  operators  for  the  creation  of  particles 

4If  we  include  the  next  term  in  the  expansion  we  obtain  quadrupole  radiation.  Quadru- 
pole  radiation  is  (r/A)  times  weaker  than  dipole  radiation  and  is  of  importance  only  when 
dipole  transitions  are  forbidden. 
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The  probability   of  a  spontaneous  transition  from  the  energy 
level  n  to  the  level  n'  is  equal  to 


'  = 

.   ,,  v  (29.37) 

- 


Here  instead  of  two  sums  over  the  wave  numbers  of  amplitudes  a 
we  left  only  one,  since  the  only  combinations  of  amplitudes  a  and  a* 


different  from  zero  are  those  which  refer  to  the  same  momentum  x. 
Using  the  last  commutation  relations,  we  find 

(a-rf>.)(a*.  />„•„)  =  1  [xJ/v,]  |«. 

Let  us  replace  the  sum  over  v.  by  an  integral  in  Eq.  (29.37)  [see 
also  Eq.  (28.23)] 

<29'38> 


and  include  the  fact  that  for  sufficiently  large  values  of  time  t  ,  we 
have,  in  accordance  with  (28*20), 

=  *(«-*n,).  (29.39) 


Then  in  the  case  of  spontaneous  transitions  Eq.  (29.39)  reduces  to 
the  law  of  conservation  of  energy 

«=S«W=!.L,  (29.40) 


since  from  Eq.  (29.40)  it  follows  that  the  energy  of  the  emitted 
photon  hu  is  equal  to  the  energy  (En  —  En>  )  lost  by  the  atom  as  a 
result  of  transition.  For  the  probability  of  spontaneous  emission, 
we  obtain 


Evaluating  this  integral  and  remember  ing  that  Pn'n*= 
we  obtain  the  final  expression  for  the  probability  of  spontaneous 
emission 

,4    ,—  4  e  "*nn'  \r  .   |*  (29.41) 

—  3          *     '    nn  I  »  \AV.-±±J 
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which  was  already  derived  in  Chapter  9  with  the  aid  of  the  semi 
classical  correspondence  principle  [see  Eq.  (9.20)]. 


D.  BETA  DECAY 

As  another  example  of  the  application  of  second  quantization 
let  us  consider  the  theory  of  )3  decay.  This  phenomenon  consists 
of  the  emission  of  an  electron  (positron)  by  a  nucleus,  leading  to 
an  increase  (decrease)  of  the  nuclear  charge  by  unity. 

The  theory  of  ]8  decay,  which  resembles  in  some  respects  the 
emission  of  photons  by  atoms,  was  constructed  on  the  basis  of 
second  quantization. 

There  are  no  photons  in  an  atom.  A  photon  is  created  from  the 
vacuum  when  an  atom  makes  a  transition  from  one  energy  state  to 
another.  Similarly,  a  nucleus  does  not  contain  electrons;  they  are 
created  only  in  the  process  of  /3  decay. 

As  a  result  of  experimental  investigation  of  j3  decay  it  has  been 
established  that  these  electrons  have  a  continuous  spectrum  bounded 
by  a  certain  maximum  energy  equal  to  the  difference  in  the  energy 
of  the  nucleus  before  and  after  the  decay.  It  was  also  established 
that  in  )3  decay  the  angular  momentum  of  the  nucleus  changes  by 
a  multiple  of  h  ,  whereas  the  angular  momentum  carried  away  by 
the  electron  equals  (lj^hm 

The  apparent  violation  of  the  laws  of  conservation  of  energy  and 
angular  momentum  in  /}  decay  was  resolved  in  the  hypothesis  of 
Pauli,  who  assumed  that  the  emission  of  an  electron  is  accompanied 
by  the  emission  of  another  particle—a  neutrino  possessing  half- 
integral  spin  and  a  rest  mass  close  to  zero. 

According  to  the  Fermi  theory,  constructed  on  the  basis  of  this 
hypothesis,  j3  decay  should  be  considered  as  the  transformation  of 
one  of  the  nuclear  neutrons  (n)  into  a  proton  (p),  an  electron  (e~)9 
and  an  antineutrino  (v) 


Similarly,  the  emission  of  positrons  in  0  decay  should  be  con- 
sidered as  the  transformation  of  a  nuclear  proton5  into  a  neutron, 
a  positron  and  a  neutrino  6 

p—  >/i-|-e+-f-v- 


We  note  that  since  the  rest  mass  of  a  neutron  is  greater  than  the  total  rest  mass  of  the 
proton,  electron,  and  antineutrino,  it  follows  that  the  decay  of  a  free  neutron  should  also 
be  observed.  The  decay  of  a  free  proton  appears  impossible  from  the  energy  standpoint  and 
therefore,  positron  decay  can  be  observed  only  in  a  bound  proton,  when  the  required  energy 
can  be  taken  up  from  the  nucleus 

We  shall  explain  the  difference  between  the  neutrino  and  antineutrino  at  the  end  of  the 
present  chapter  in  the  discussion  of  the  nonconservation  of  parity 
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Furthermore,  the  capture  of  a  bound  electron  is  also  possible;  as 
a  result  of  electron  capture  a  proton  is  changed  into  a  neutron  and 
emits  a  neutrino  (p  +  e~  ~*  n  +  v)  .  As  a  rule,  an  electron  from  the  K 
shell  is  absorbed  in  this  process,  and  therefore  this  phenomenon  is 
called  K  capture.  K  capture  is  similar  in  nature  to  positron  j3  decay, 
since  in  both  cases  the  charge  of  the  nucleus  is  reduced  by  unity. 

We  shall  not  consider  here  the  details  of  j3  decay;  our  task 
will  be  to  describe  in  general  terms  the  creation  of  an  electron 
and  antineutrino  following  the  Fermi  theory. 

The  energy  of  interaction  of  a  neutron  with  the  electron- 
antineutrino  field  can  be  written  as 

V*  =ft?tf  ,  (29.42) 

where  f  is  a  coupling  constant  introduced  by  Fermi.  The  magnitude 
of  /  is  very  small  (/^  1.4  x  10~49erg  x  cm3  )  so  that  this  inter- 
action is  called  a  weak  interaction.  The  spontaneous  decay  of 
particles  is  caused  mainly  by  weak  interactions;  therefore,  the 
lifetime  of  elementary  particles  or  nuclei  is  comparatively  large 
and  varies  from  a  fraction  of  a  second  to  billions  of  years.  Nuclear 
processes,  on  the  other  hand,  are  caused  by  strong  interactions, 
which  sometimes  are  a  thousand  times  larger  than  the  electromag- 
netic interactions.  The  duration  of  the  processes  caused  by  such 
interactions  is  very  short  (of  the  order  of  10~23  sec). 

Neglecting  spin   effects,   the  wave  functions  i|*J   and  <|»~  can  be 

represented  as 
* 

l  *  _    -*/**icKt-ik  -  r 

(29.43) 

<>    =     -a          HI-X  '  r. 

K  there  are  no  particles  present  at  the  initial  instant,  the  following 
relations  hold  for  the  amplitudes  a*  and  b*:  a*a~b*b  =  Q,  aa*  = 


Then,  according  to  Eq.  (28.12),  we  have  the  following  expression 
for  the  coefficient  C(t): 

t 
C  (0  =  —  £Jr-3  Vpna*b*  J  dterWr-Kp-K-*)  .  (29.44) 

o 

Here  En  =  chKn  is  the  neutron  energy  and  Ep  =  cHKp  is  the  proton 
energy.  The  matrix  element  of  the  interaction  Vpn  is 

*.  (29.45) 

where  Xn  is  the  wave  function  of  the  neutron,  XP  is  the  wave  function 
of  the  proton,  and  7  is  the  Dirac  matrix  which  determines  the 
nature  of  the  interaction.  Since  the  interaction  energy  must  be 
a  scalar  quantity,  an  analogous  matrix  should  also  relate  the  spinor 
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amplitudes  of  the  wave  functions  of  the  electron  and  neutrino  in  Eq. 
(29.42).  However,  the  influence  of  the  specific  choice  of  the  Dirac 
matrices  manifests  itself  only  in  the  spin  effects,  which  are  ne- 
glected in  the  present  treatment. 

Just  as  in  the  investigation  of  dipole  radiation,  the  quantity  in 
the  exponent  in  the  matrix  element  (29.45)  is  much  less  than  unity, 
that  is,  inside  the  nucleus 


where  R  is  tue  nuclear  radius.  The  exponential,  therefore,  may  be 
expanded  in  a  series  ^ 

e-K*f  x)r=  i  _  i  (k  -fx).  r+... 

If  the  matrix  element  (29.45)  does  not  vanish  upon  replacement 
of  the  exponential  by  unity,  the  corresponding  transitions  are  said 
to  be  allowed  /?  transitions* 

We  note  that  the  allowed  transitions,  which  correspond  to  the 
dipole  transitions  in  the  theory  of  photons,  are  associated  with 
definite  selection  rules.7 


7In  the  case  of  vector  interaction,  the  matrix  element 


(the  fourth  component  of  the  four-dimensional  velocity,   the  first  three  components  for  the 
nucleon    at   rest   vanish)  will   differ  from   zero   (allowed    transitions)    if  the   spin   of  the 
nucleus  (that  is,  the  total  angular  momentum)  remains  unchanged  in  the  /?  decay  (AJ  -  0, 
Fermi  transitions)     This  Fermi  selection  rule  is  satisfied  for  the  majority  of  nuclei. 
There  are,  however,  some  cases  of  allowed  /3  decay,  for  example, 

2He6  ->  3Li6  +  e~  +  v 

in  which  the  spin  of  the  nucleus  changes  by  unity  (AJ  -  1)  The  spin  of  the  2He  nucleus 
is  equal  to  zero  (one  alpha  particle  plus  two  neutrons  with  antiparallel  spins),  while  the 
spin  of  the  3Li  nucleus  is  equal  to  unity  (one  alpha  particle  plus  one  proton  and  one 
neutron  with  parallel  spins,  just  as  in  deuterium). 

In  order  to  explain  these  selection  rules,  Gamow  and  Teller  pointed  out  that  when  we 
form  the  interaction  energy  operator,  which  is  a  scalar  (V-  VpnVey),  one  may  take  a 
product  of  two  vectors,  as  well  as  other  relativistically  invariant  combinations  of  the  four 
wave  functions  For  example,  we  may  take  a  product  of  two  pseudovectors,  which  is  also  a 
relativistic  invariant  The  pseudovcctor  interaction 


leads  to  the  selection  rules  A  J  -  0,    "tl     It  should  be  emphasized  that  for  a  nucleon  at 
rest,  the  first  three  components  will  be  different  from  zero 

In  view  of  the  fact  that  the  pseudovector  interaction  forbids  0  ->  0  transitions,  which 
were  nevertheless  observed  experimentally,  as  well  as  for  several  other  reasons,  present- 
day  theory  uses  a  combination  of  vector  and  pseudovector  interactions  (as  in  the  Feynman 
and  Gell-Mann  version  of  the  theory  of  fi  decay).  This  version  enables  us  to  explain  the 
basic  experimental  data  obtained  in  the  investigation  of  /?  decay. 


492 


FUNDAMENTALS  OF  NUCLEAR  PHYSICS 


In  the  subsequent  discussion,  we  shall  restrict  ourselves  to  the 
treatment  of  allowed  transitions;  in  this  case  it  is  sufficient  to  make 
the  exponential  term  in  the  matrix  element  (29.35)  equal  to  unity. 
We  then  obtain  the  following  expression  for  the  coefficient  C: 


'  —  JL 


(29.46) 


where  Ea==chKnp  =  ch(Kn  —  Kp)  is  the  energy  carried  away  by  the 
light   particles    (electron,    neutrino   or    antineutrino)    in  ]S  decay. 


Fig.  29.1  Energy  distribution  of  electrons  in 
p  decay  (according  to  the  Fermi  theory). 
KQ  is  the  maximum  energy  of  the  /i  spectrum 
The  origin  of  the  coordinate  system  corresponds 
to  the  energy  m0c 

Using  Eq.  (29.39)  and  changing  the  summation  over  the  momenta 
of  electron  and  neutrino  to  an  integration  [see  (29.38)],  we  obtain 
an  expression  for  the  probability  of  /3  decay 


=  i  I  c*c  =  TfifeT-  J 


k,  X 


Integrating  this  expression  over  all  possible  angles  at  which  the 
electron  and  antineutrino  emerge  from  the  nucleus  and  also  over 
the  energy  of  the  antineutrino,  we  obtain  an  equation  for  the  energy 
distribution  (E  =  cHK)  of  the  electrons  (the  /?  spectrum) 


w 


f 
=  J    w(E)dE9 


(29.48) 


where 


w  (E)  = 


(29.49) 
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The  energy-distribution  curve  of  the  j8  electrons,  obtained  in  the 
Fermi  theory,  is  plotted  in  Fig.  29.1. 8  From  this  curve  it  is  clear 
that  the  energy  of  the  emitted  electrons  lies  between  £mm=  m^ 
and  Emax=Eo  =  En  —  Ep 

According  to  the  Pauli- Fermi  theory,  there  is  no  violation  of 
the  law  of  conservation  of  energy,  since  the  total  energy  of  the 
emitted  antineutrino  and  electron  must  always  be  equal  to  the  total 
energy  lost  by  the  nucleus  during  ft  decay.9 


E.    NONCONSERVATION  OF  PARITY  IN  THE  DECAY 
OF  PARTICLES 

One  of  the  fundamental  discoveries  in  the  theory  of  weak  interactions  was  the  dis- 
covery of  the  nonconservation  of  parity  by  Lee  and  Yang  ( 1956),  This  phenomenon  gives 
rise  to  a  spatial  asymmetry  in  the  spontaneous  decay  of  elementary  particles  and,  in 
particular,  in  nuclear  0  decay. 

The  nonconservation  of  parity  can  be  observed  experimentally  in  the  following  two 
phenomena. 

1.  The  asymmetry  of  the  angular  distribution  of  electrons  in  the  8  decay  of  nuclei 
with  an  oriented  spin  (the  number  of  0  electrons  emerging  along  the  direction  of  nuclear 
spin  does  not  equal  the  number  of  electrons  emerging  in  the  opposite  directionX 

2.  The  existence  of  circular  polarization  (hehcity)  in  the  particles  formed  during 
decay  (for  example,  electrons  formed  in  0  decay  or  /j  mesons  formed  in  the  decay  of 
it  mesons),  even  in  the  case  when  the  decaying  system  has  zero  spin. 

The  phenomenon  of  nonconservation  of  parity  in  0  decay  or  in  the  decay  of  a  TT  meson 
was  explained  with  the  help  of  the  theory  that  assigned  a  definite  circular  polarization 
(helicity)  to  the  neutrino.  At  one  time  it  was  thought  that  the  neutrino  (which  is  formed 
in  positron  0  decay)  and  the  antineutrino  (which  is  formed  in  electron  0  decay)  were 
identical  particles  (Majorana's  hypothesis  X  It  was  suggested  that  this  hypothesis  could 
be  tested  in  double  0  decay. 

If  the  neutrino  and  antineutrino  were  identical,  one  would  expect  comparatively 
large  values  for  the  probability  of  double  0  decay  without  the  emission  of  a  neutrino  (one 
neutron  of  the  nucleus  emits  an  electron  and  a  neutrino,  while  another  neutron  emits 
an  electron  and  absorbs  this  neutrino),  that  is,  with  the  emission  of  only  two  electrons. 
If,  however,  double  0  decay  consisted  simply  of  two  successive  identical  0  decays  with 
the  emission  of  two  electrons  and  two  antineutrino s,  the  probability  of  decay  should  be 
much  less.  Experiment  has  confirmed  the  correctness  of  the  second  hypothesis  and 
clearly  demonstrated  that  the  neutrino  must  be  different  from  the  antineutrino. 

We  note,  incidentally,  that  both  particles  are  neutral  and  have  a  spin  1/2.  Physicists, 
however,  were  able  to  establish  a  difference  between  the  particles  from  phenomena 
associated  with  nonconservation  of  parity.  It  has  been  demonstrated  experimentally  that 
the  asymmetry  observed  during  positron  B  decay  of  a  nucleus  with  an  oriented  spin, 
when  a  neutrino  is  emitted  together  with  a  positron,  is  the  reverse  of  the  asymmetry 
that  is  observed  during  electron  0  decay,  in  which  an  antineutrino  is  emitted  together 
with  an  electron.  It  was  assumed,  therefore,  that  the  neutrino  differs  from  the  anti- 
neutrino  by  the  type  of  circular  polarization.  In  order  to  explain  the  experimental  data, 
it  was  necessary  to  postulate  that  a  neutrino  resembles  a  photon  with  left-hand  circular 
polarization,  while  an  antineutrino  resembles  a  photon  with  right-hand  polarization.  The 

It  should  be  noted  that  the  maximum  of  this  curve  is  slightly  shifted  towards  small 
energies.  This  asymmetry  is  due  to  the  fact  that  the  antineutrino  mass  is  equal  to  zero, 
while  the  electron  mass  is  different  from  zero  If  the  mass  of  the  antineutrino  were  equal 
to  the  electron  mass,  -this  curve  (neglecting  the  Coulomb  attraction  of  the  electron  and  the 
nucleus)  would  be  symmetrical,  that  is,  the  maximum  would  occur  at  the  point  E/2 

For  more  details,  see  H  Bethe  and  P  Morrison,  Elementary  Nuclear  Theory,  New  York: 
John  Wiley  &  Sons,  Inc  ,  1958 
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only  difference  is  that  the   spin  of  the  photon  is  1  (in  units  of  H),  whereas  the  spin  of 
the  neutrino  is  1/2. 


$(left) 


Neutrino 


Antineutrmo 


Fig.  29  2  Hehcity  of  the  neu- 
trino and  antmeutrmo  The 
neutrino  has  a  left-hand  circular 
polarization  and  the  antineutnno 
has  a  right-hand  circular  polari- 
zation 


Fig.  29.3   Hehcity  of  a  left- 
handed  neutrino  in  right-hand 
and       left-hand       coordinate 
systems 


The  circular  (longitudinal)  polarization  of  the  neutrino  is  generally  called  the  helicity. 

The  neutrino  has  left-hand  circular  polarization,  or  negative  helicity.  This  means 
that  if  a  left-handed  screw  rotates  along  the  direction  of  polarization,  it  moves  in  the 
direction  of  the  momentum.  The  antineutrino,  however,  has  right-hand  circular  polari- 
zation, or  positive  helicity  (see  Fig.  29.2).  In  order  to  conserve  helicity  when  changing 
from  one  Lorentz  frame  of  reference  to  another,  so  that  it  can  be  adopted  as  a  character- 
istic of  the  neutrino,  it  is  necessary  that  the  rest  mass  of  the  neutrino  be  exactly  equal 
to  zero. 10 

Several  authors  describe  polarization  with  the  aid  of  an  axial  vector,  which  is  per- 
pendicular to  the  plane  of  rotation  and  has  a  different  direction  in  the  right-hand  and 
left-hand  coordinate  systems  (see  Fig.  29.3).  It  should  be  noted,  however,  that,  although 
the  axial  vector  s  and  the  polar  vector  p  have  different  mutual  orientations  in  the  right- 
hand  and  left-hand  coordinate  systems,  the  helicity  is  nevertheless  conserved;  that  is, 
the  helicity  of  the  neutrino  is  still  negative,  and  only  the  method  of  description  has 
changed. 1 1 

Starting  with  the  polarization  properties  of  the  neutrino,  we  shall  make  an  attempt 
to  give  a  qualitative  explanation  of  the  nonconservation  of  parity  during  the  spontaneous 
decay  of  particles. 

Let  us  consider,  for  example,  the  0  decay  of  nuclei  with  oriented  spin.  We  note  that 
the  spin  of  the  nucleus  (its  longitudinal  polarization)  is  more  naturally  described  by  a 
rotation,  since  the  direction  of  the  axial  spin  vector  is  arbitrary.  The  spatial  asym- 
metry which  should  be  observed  in  the  phenomena  characterized  by  the  nonconservation 
of  parity  is  associated  with  the  fact  that  in  electron  0  decay,  right-handed  antmeutrmos 
are  emitted  upwards  and  downwards  with  spins  oriented,  respectively,  parallel  and 
antiparallel  to  the  spin  of  the  nucleus.  The  electrons  will  be  formed  predominantly  with 
a  helicity  opposite  to  that  of  an  antineutrino  (that  is,  with  a  negative  helicity).  This  pro- 
duces a  spatial  asymmetry  due  to  which  the  number  of  electrons  emitted  in  the  direction 
of  nuclear  spin  does  not  equal  the  number  of  electrons  emitted  in  the  opposite  direction 
(see  Fig.  29.4).  In  general,  we  obtain  the  following  equation  for  the  number  of  emitted 
electrons  as  a  function  of  the  angle  0  between  the  direction  of  electron  momentum  and  the 
upward  direction,  which  two  directions  form  a  right-handed  system  with  the  polarization 
of  a  nucleus:  ^J8)  =  ^([  _  ^  ^  ^  (29>50) 

where  a  is  positive  and  is  equal  to  approximately  0,4. 

If  the  rest  mass  of  a  particle  differs  from  zero,  the  parallel  spin  and  momentum  vectors 
may  be  directed  at  an  angle  after  transition  from  one  Lorentz  frame  of  reference  to  another 

USmce  the  direction  of  the  (axial)  spin  vector  *  relative  to  the  polar  vector  of  the  mo- 
mentum p  is  different  in  the  right-hand  and  left-hand  coordinate  systems,  Lee,  Yang,  Lan- 
dau [Zhurnal  Ekspenmental'noy  i  Teoretichebkoy  Fiziki,  32,  405  (1957)],  and  others  assume, 
on  the  contrary,  that  in  this  case  the  negative  helicity  of  a  neutrino  changes  to  a  positive 
helicity.  In  other  words,  they  assume  that  the  method  of  geometric  description  of  the  parti- 
cle can  change  its  internal  properties  (helicity). 
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Asymmetry  was  detected  in  the  electron  3  decay  of  Co  60  nuclei  with  oriented  spins 
and  also  in  the  angular  distribution  of  electrons  in  the  3  decay  of  free,  polarized  neutrons 
(for  which  a^  0.1). 

Since  a  neutrino  with  a  negative  helicity  emerges  during  positron  decay,  the  asym- 
metry pattern  will  be  opposite  to  the  one  described  above.  The  number  of  emitted  posi- 
trons is  related  to  the  angle  &  by  the  equation 

we+(§)  =  w0^  (1  +  a  cos  &);  (29.51) 


that  is,  the  positrons  that  are  formed  have  mainly  a  positive  helicity  and  are  emitted 
primarily  upwards.  An  asymmetry  opposite  to  that  of  electron  decay  was  observed  ex- 
perimentally in  positron  3  decay  of  Co58  nuclei  with  oriented  spins. 


ck-   4i 


Electron 
3  decay 


Positron 
3  decay 


Fig.  29.4  Schematic  diagram  of  the  ft  decay 
of  nuclei  with  oriented  spins  The  direction 
of  rotation  characterizes  the  direction  of  the 
spin  of  the  particles,  p  is  the  particle 
momentum 

Longitudinal  polarization  was  observed  particularly  clearly  in  the  spontaneous  decay 
of  pions  into  muons  and  a  neutrino.  Let  us  choose  a  coordinate  system  in  which  the  pion 
is  at  rest  and  consider  the  negative  pion  which  decays  into  a  negative  muon  and  an  anti- 
neutrino.  Since  the  antineutrino  has  a  positive  helicity  and  the  momenta  of  the  muon  and 
the  antineutrino  must  be  equal  and  opposite,  we  find  that  the  muon  also  must  have  a 
positive  helicity.  Indeed,  only  in  this  case  will  the  total  spin  of  the  muon-antineutrino  sys- 
tem be  equal  to  the  initial  spin,  that  is,  zero.  In  the  decay  of  positive  pions  into  a  neutrino 
and  positive  muons,  the  muons,  however,  will  obviously  have  a  negative  helicity  (see 
Fig.  29.5X 
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Fig.  29.5.  Decay  of  a  pion  at  rest  into  a  muon  and  a  neutrino 
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The  total  longitudinal  polarization  of  the  created  muons  has  also  been  confirmed  experi- 
mentally. 

A  more  detailed  discussion  of  the  nonconservation  of  parity  lies  outside  the  scope 
of  this  book,  and  we  must  refer  the  reader  to  the  special  literature  on  this  subject.  12 


See  the  papers  in  T    D    Lee  and  C.   N.  Yang,   Nobel  Lectures,  Physics,  New  York: 
Elsevier  Publishing  Company,  1964 


Appendix  A 

Hilbert  Space  and  Transformation  Theory 


There  is  an  elaborate  formal  structure  of  quantum  mechanics 
that  is  important  for  several  reasons:  the  wave  mechanics  and 
the  matrix  mechanics  can  be  unified  in  one  coherent  scheme.  The 
conceptual  structure  has  a  deep  intuitive  appeal  that  gives  the 
theory  a  sense  of  completeness  and  solidityc  The  physical  content 
can  be  embedded  in  an  extensive  and  rigorous  mathematical 
framework.  And,  most  important  for  the  validity  of  the  physical 
theory,  it  is  a  powerful  and  flexible  phenomenological  tool  that 
embraces  a  wide  variety  of  empirical  knowledge.  The  conceptual 
structure  and  statistical  foundations  of  quantum  mechanics  are 
formulated  most  fully  in  the  framework  of  "abstract  Hilbert 
space."  It  is,  however,  not  necessary  to  go  into  the  mathematical 
technicalities  to  achieve  an  accurate  physical  grasp  of  the  theory. 
For  a  complete  treatment  of  the  mathematical  and  statistical 
foundations  one  should  consult  the  original  literature.1  In  this 
appendix  we  shall  outline  some  definitions  and  notational  apparatus 
around  three  broad  topics:  vector  spaces,  operators  and  the  inner 
product. 

Vector  Spaces 

An  abstract  (complex)  vector  space  is  a  set  of  abstract  ele- 
ments called  vectors  (or  points  in  the  space)  which  together  with 
complex  numbers  obey  the  following  axioms: 

(1)  If  Va  and  Vb  are  vectors,  Va  *  Vh  =  Vb  *  <Pa  is  also  a  vector. 

(2)  Va  f-  Wb  +  Wc>  -  Wa  f  Vfc)  +  *c . 

(3)  If  x  and  y  are  complex  numbers  and  V  is  a  vector,  then  xV 
is  a  vector  and  x(y*V)  =  (xy)*P. 

(4)  (x  +  y)<P  -  x*V  +  yW  and  x(¥fl  +  HV  =  x»fl  +  *Wb* 

(5)  There  is  a  null  vector  Vnuii  -  0  such  that  ¥  f  Vnun  -  ¥  and 
O^P  ^  Vnull  for  all*. 


^.A.M.  Dirac,  The  Principles  of  Quantum  Me  chart  icv,  fourth  edition,  Clarendon  Press, 
Oxford,  1958.  W.  Heisenberg,  The  Physical  Principles  of  Quantum  Theory,  translated  by 
C.  Eckart  and  C.  Hoyt,  University  of  Chicago  Press,  Chicago,  1930.  (Dover  Publications, 
New  York,  1949).  J.  von  Neumann,  Mathematical  Foundations  of  Quantum  Mechanics, 
translated  by  R.  T.  Beyer,  Princeton  University  Press,  Princeton,  1955. 
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The  structure  of  a  vector  space  is  appropriate  for  introducing 
the  notion  of  linear  independence:  the  set  of  vectors  Vi,  ¥2*  •  •  •  * 
Vk  are  linearly  independent  if  the  only  set  of  complex  numbers  zi, 
22.  •  •  •  .  *k  satisfying  the  relation  21^1  +  22^2  +  •  •  •  +  zk^k  =  ^nuii 
is  the  trivial  set  z\  =  22  -  -  -  •  -  zk  -  0.  If  it  is  possible  to  specify 
n  linearly  independent  vectors  but  notn  +  1,  the  space  is  said  to  be 
^-dimensional.  If  there  is  no  limit  to  the  number  of  linearly  inde- 
pendent vectors  that  may  be  specified,  then  the  space  is  "infinite 
dimensional."  In  an  n-dimensional  space  a  set  of  n  linearly  inde- 
pendent vectors  ^i,  *P2,  .  .  .  ,  ¥„  forms  a  basis  that  spans  the  space 
in  the  sense  that  any  element  ya  of  the  space  can  be  written  as  a 
linear  combination  of  the  basis  vectors  V,,  i  -  1,  2,  .  .  .  ,  w, 


The  coefficients  at,  i  1,  2,  .  .  .  ,  n  are  complex  numbers  that  char- 
acterize the  vector  Va  in  the  basis  ^  ¥2»  •  •  •  »  ^n.  If  a/  and  &it 
/  =  1,  2,  .  .  .  ,  n  are  the  components  of  the  vectors  *Pa  -  2atV,  and 
Vb  --  16,^,  then  cz  =-  a,  +  b,  are  the  components  of  the  vector  Vc  - 
Sc,V,  =  ^a  f  V^  and  zal  are  the  components  of  the  vector  2Wa, 
where  z  is  a  complex  number. 

The  geometric  concept  which  is  the  fe-dimensional  generaliza- 
tion of  the  notion  of  "lines  and  planes  passing  through  the  origin," 
can  be  developed  with  the  following  definition: 

A  subset  9W  of  a  vector  space  is  called  a  linear  manifold  if  it 
contains  all  the  linear  combinations  y\^\  v  ^2^2  +  •  •  •  +  yk^k  along 
with  any  k(k  -  1,2,.  .  .  )  of  its  elements  Vi,  W2>  .  .  .  ,  W*.  Alterna- 
tively, if  8  is  an  arbitrary  set  of  vectors  containing  the  distinct 
vectors  Wi,  ^2,  .  .  .  ,  ^fe,  then  the  set  of  all  linear  combinations 
*iVi  +  X2^2  +  •••*-  ^cfe^fe  (with  arbitrary  complex  numbers  x\,  X2, 
.  .  .  ,  xfe)  is  a  linear  manifold  9W.  SK  is  called  the  '*  linear  manifold 
spanned  by  SI."  If  the  vectors  Vi,  V2>  -  •  •  »  ^  are  linearly  inde- 
pendent, the  manifold  is  fc-dimensional. 

The  utility  of  the  concept  of  a  linear  manifold  is  that  it  is  the 
domain  of  definition  of  a  linear  operator.  In  general  an  operator 
is  a  mapping  or  correspondence  from  a  domain  consisting  of  cer- 
tain points  in  the  vector  space  into  a  range  consisting  of  certain 
other  points,  denoted  symbolically  by  V  +  R(¥).  In  quantum  mech- 
anics we  are  concerned  with  linear  operators  for  which  by  defini- 
tion 

R(V  +  *)  =  B(V)  +  R(*)      and     B(aV)  =  atf  (¥)  . 

That  is,  the  domain  of  definition  of  the  operator  R  is  a  linear 
manifold.  Note  that  a  linear  operator  need  not  be  defined  for  all 
vectors  in  the  space.  For  example,  if  the  space  is  the  "function 
space"  consisting  of  square  integrable  functions  on  the  interval 
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_«,  <  x  <  ~,    then   there  are  vectors  (functions)  if/  for  which  xtf/  or 
d*///dx  are  not  square  integrable. 

There  is  a  "projection  property"  associated  with  a  linear 
manifold:  any  vector  V  may  be  resolved  into  the  sum  of  two  vec- 
tors ¥  =  ¥||  +  ¥jf  where  V||  is  in  the  manifold  2»  (i.e.,  there  are 
complex  numbers  cj,  C2,  .  .  .  ,  c&  such  that  V||  -  c  i*Pi  4-  02^2  f  •••-«- 
c-fc¥fcf  where  Vit  ¥2,  .  .  .  ,  V*  span  the  SK)  and^j  is  entirely  outside 
2H.  As  a  notational  device  we  may  define  the  linear  operator  P^ 
as  follows:  ¥||  -  P™W  for  all  ¥.  The  domain  of  definition  of  P^  is 
the  entire  space,  while  the  range  consists  of  2ft  and  the  null  vector. 
If  *!  lies  outside  2K,  Pm^i  =  Vnuii.  If  V2  lies  wholy  within  9»,  PWV2  = 
¥2.  If  ^  is  an  arbitrary  vector,  P^W  lies  entirely  within  2tt  and 
pm^  -=  pg»(pa»W)  -  ps»v-  Since  this  relation  holds  for  all  W,  we  may 
assert  the  operator  equation 


An  (Hermitian)  operator  satisfying  this  relation  is  called  idempotent 
or  a  projection  operator.  It  has  eigenvalues  0  and  1. 

Operators 

Let  81  stand  for  a  physical  observable  which  upon  sharp  meas- 
urement yields  any  one  of  a  sequence  of  values  8{,  82,  •  •  •  which 
are  characteristic  of  the  observable.  The  set  of  values  Hi,  82,  •  .  • 
is  called  the  spectrum  of  8.  An  algebra  of  observables  is  set  up 
as  follows:  let  ^  be  a  complex  number,  then  z$l  is  an  observable 
with  characteristic  values  ztti,  z%2,  •  -  •  and  if  a  measurement  of  8 
yields  8jfe,  the  measurement  of  z  8  yields  z8iU  The  observable  B2  = 
88  yields  the  measured  value  (8fe)2  when  8  yields  Si.  If  8  and  8 
are  simultaneously  measurable  then  the  sum  8  t  8  is  defined  to  be 
the  observable  that  yields  8^  f  S3?  when  8  yields  8^  and  SB  yields 
«'/.  Finally,  the  product  8®  may  be  defined  by  the  artifice 

8»  --   -J  (8  +  »)2  -  ^(8  -  S)2 
4  4 

If  8  and  SB  are  simultaneously  measurable.  These  definitions  pro- 
vide a  "physical"  construction  of  polynomials  of  several  simul- 
taneously measurable  observables  8,  83,  etc. 

In  quantum  mechanics  the  mathematical  idealization  of  an  ob- 
servable is  a  linear  operator  on  a  Hilbert  space.  Let  us  first  out- 
line the  purely  algebraic  aspects  of  operators  which  can  be  defined 
without  reference  to  the  space  on  which  they  act.  An  abstract  op- 
erator algebra  is  a  collection  of  elements  called  operators  which 
together  with  complex  numbers  are  endowed  with  the  structure  of 
a  vector  space  (in  a  technical  sense,  not  to  be  confused  with  the 
Hilbert  space)  and  in  addition  an  operator  product.  Explicitly, 
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(1)  If  M  and  /V  are  operators,  then  M  +  N  =  N  +  Mis  also. 

(2)  M  +  (IV  f  0)  -  <M  +  IV)  +  0. 

N   (3)    Multiplication  by  a  complex  number  is  allowed;  i.e.,  xM  is 
an  operator  and  x(yM)  -  (xy)M. 

(4)  (x  +  y)M  -  M  ±  yM  and  x(M  ±  IV)  -  xM  ±  x/V. 

(5)  There    is  a  null  operator  0  such  that  0  +  M  -  M  and  OM  -  0 
for  all  M. 

Furthermore,  there  is  a  composition  law  (the  "operator  prod- 
uct") which  assigns  to  each  ordered  pair  of  operators  (M,/V)  another 
operator  which  is  denoted  by  M/V.  The  composition  law  obeys  the 
following  axioms  (we  now  denote  operators  by  TI,  T2,  T3): 

(a)  T\T%  is  a  operator  if  TI  andT2are;  i.e.,  the  operator  algebra 
is  closed  with  respect  to  the  product. 

(b)  Ti(T2T3)  -  <TiT2)T3   (associative  rule). 

(c)  Ti(aT2  +  bT3)  =  aTiT2  +  bTiTa     and     (aT2 


(d)  There  is  a  unit  operator  Tunit  -  1  such  that  IT  -  TI  -  T  for 
all  operators  T. 

Once  a  concrete  identification  of  the  operators  is  made  and  a 
labeling  scheme  adapted  a  "multiplication  table"  can  be  setup. 
The  multiplication  table  can  be  formulated  with  the  help  of  the 
notion  of  linearly  independent  operators.  Assuming  there  are  only 
a  finite  number  n  of  linearly  independent  operators  TI,  T2  .....  Tfl, 
we  may  express  any  operator  as  a  linear  combination  of  these.  In 
particular,  as  the  operator  algebra  is  closed, 


where  the  complex  coefficients   Ctk  are  the  structure  constants  of 
the  algebra. 

The  algebraic  properties  defined  by  operator  axioms  (l)-(5)and 
(a)-(d)  can  be  realized  concretely  in  terms  of  linear  transforma- 
tions on  a  vector  space.  The  operator  TI  is  identified  with  the 
transformation  which  maps  V  into  TiW.  The  transformation  is 
linear 


Sum  and  difference  TI  ±  T2  is  identified  with  (T!  ±  T2)*P  -  TI*P  ± 
while  zTi  corresponds  to  the  transformation  (zTtfV  -  z(TiV),  these 
relations  being  valid  for  all  ¥  for  which  TI¥  and  T2*P  are  defined. 
If  the  range  of  T2  includes  the  domain  of  TI,  the  operator  product 
TiT2  is  defined  by  (TiT2)V  -  Ti<T2¥)  for  all  V  in  the  domain  of  T2. 
A  transformation  is  nonsingular  if  it  transforms  distinct  vec- 
tors into  distinct  vectors;  that  is,  if  ^  /  ¥2,  then  T^i  -/  T^2.  An 
alternative  definition:  T  is  nonsingular  if  to  each  vector  $x  -  T^! 
there  is  a  unique  solution  VL  In  particular,  if  TVi  -  Vnuiu  then 
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The  unique  solution  ^i  may  be  expressed  in  terms  of 
4>i  by  an  operator  S  defined  by  the  relation  ^i  =  S$i.  It  then  fol- 
lows that  ^i  -  ST^i.  If  this  latter  equation  holds  for  all  *Plf  we  may 
assert  the  operator  equation  ST  -  1;  S  is  called  the  left  inverse 
of  T.  If  TS  -  1,  then  S  is  the  right  inverse  of  T.  If  ST  -  1  -  TS, 
then  S  is  the  inverse  of  T  and  is  given  the  notation  T"1  also,  T  -  S'1. 
If  A  and  B  have  inverses,  then  the  inverse  of  AB  is  (AB)'1  -  B~l  A~l 
even  if  A  and  B  do  not  commute.  Finally,  if  the  vectors  Vi,  ^2* 
.  .  .  ,  Wjfe  are  linearly  independent  and  span  a  manifold  2)1 ,  and  if  V 
is  a  nonsingular  transformation,  thenVVi,  V^  •  •  •  »  Wk  are  linearly 
independent  and  span  the  transformed  manifold  V2W. 

Inner  Product 

An  Hermitian  inner  product  is  a  mapping  that  assigns  to  each 
ordered  pair  of  vectors  W,  $  a  complex  number  designated  by  (¥,  4>). 
The  number  (V,<t>)  is  called  the  inner  product  of  W  and<J>.  An  inner 
product  has  the  following  properties: 

(V,  $)  -  <<&,  ¥)*     (Hermitian  symmetry) 
(¥,a<D  +  60)  =  a(V,«)  +   6(¥,Q) 
(V,V)  1  0  and(y,¥)  =  0  implies  V 


The  norm  or  "length"  of  a  vector  V  is  ||  W||  -  V<¥,¥)  and  the  dis- 
tance between  V  and  O  is  ||  V  -  0||.  If  (W,<I>)  -  0  the  vectors  are 
orthogonal.  A  set  of  vectors  ^1,^2,  .  .  .  forms  an  orthonormal 
basis  if  (¥„¥&)  -  5^,  i,fe  -  1,2,  .  .  .  .  Some  important  theorems  are 
the  Schwartz  inequality  |(V,<W|  <||V||  •  ||$||  and  the  triangle  in- 
equality ||W||  +  ||$||  _i  ||W  +  $||.  For  each  linear  manifold  2R there  is 
an  orthonormal  basis  which  spans  9M. 

The  projection  operator  P^  of  a  manifold  3M  spanned  by  the 
vectors  Wi,*P2>  •  •  •  >  ^k  can  D^  written  in  terms  of  the  inner  prod- 
uct as 

W  -   E  ^(VnW 

i=  i 

The  projection  operator  onto  the  ray  or  one  dimensional  manifold 
defined  by  y,  is  PZW  -  ¥, <¥„¥).  If  V!,  V2,  ...  is  an  orthonormal  set 
the  projection  operators  are  related  to  the  vectors  by  the  equations 
Pt^k  -  5,jk^.  The  orthonormality  property  is  P,Pfe  -  d^Pk  and  the 

completeness  relation  is  SPt  =  1,  the  unit  operator. 

i 

Let  ^1,^2*  ...  be  the  eigenfunctions  of  an  operator  M  corre- 
sponding to  the  eigenvalues  Mi,M2,  ....  For  simplicity,  let  us 
assume  the  eigenvalue saredistinct,  Mj  ^  Mj^for/  ^  k\  then  (*,,^)  -  0 
and  the  vectors  may  be  normalized  so  they  form  an  orthonormal 
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basis.    The  relation  of  the  projection  operators  P,  of  this  ortho- 
normal  basis  to  the  operator  M  is  the  following:   MP,  -  P,M  -  MjP/. 

The  spectral  resolution  of  M  is  M  -  EM,'P;.  Let  F(A)be  an  arbi- 

;=  1 

trary  function  of  a  real  or  complex  variable  A.  The  operator  func- 
tion F(M)  is  defined  by 

00 

F(M)  =   E  F(M;>P7 

i=i 

In  the  mathematical  realization  of  an  observable  SI  in  terms  of 
an  operator  A,  the  eigenvalues  Ai,A2  of  the  operator  A  are  identi- 
fied with  the  characteristic  values  8li,  82  of  the  observable.  The 
construction  of  polynomials  of  observables  is  identified  with  the 
construction  of  functions  of  an  operator  given  just  above. 

There  is  a  particularly  important  notation,  the  Dirac  notation, 
in  which  a  ket  \  >  is  placed  around  the  symbol  *P  to  denote  a  vector. 
|  V  >  is  called  a  ket  vector.  The  inner  product  between  vectors  Vfl 
and  ¥&  is  denoted  by  <  Wa  |  W6  >.  The  expression  <  Va  |  is  regarded 
as  a  vector  in  its  own  right;  it  is  called  a  bra  vector  or  simply  a 
bra.  The  primary  distinction  between  bra  and  ket  vectors  is  that 
if  the  abstract  vector  1P  transforms  as  ^  ->  UV  under  a  unitary 
transformation  U,  the  bra  |¥>-»[7|¥>=  !(/¥>,  while  the  ket  <V\ 
transforms  (contragrediently)  as  <V\  -»  <UV\  =  <V\U+  =  <¥|U"1. 
Under  these  transformation  properties  the  inner  product  <  W|<I»  -= 
<  UV  |  I/O  •>  remains  invariant.  Matrix  elements  of  an  operator  M 
are  written  as  <V|M|$>  =  (T,MO).  In  the  Dirac  notation,  the  de- 
fining equation  for  a  projection  operator  reads 


for  an  arbitrary  ket  |  W  >.  Since  |  ¥  >  is  arbitrary  it  is  a  permis- 
sible notational  device  to  omit  writing  the  |  ¥>.  Then  the  expres- 
sion for  a  projection  operator  in  the  new  notation  is 

n 

P<Dt  -    E  IV.xVJ 

where  the  sum  runs  over  the  vectors  ^i,^  .  .  .  ,  Vn  which  span 
the  manifold  9W.  An  arbitrary  operator  A  may  be  expressed  in 
terms  of  its  matrix  elements  A^  -  (0,,  AO&)  in  a  complete  ortho- 
normal  basis  $i,$2>  ...  by  the  notation 


There  is  a  notational  simplification  possible  when  working  with 
a  particular  basis  <Di,<&2,  .  ,  .  in  that  the  ubiquious  <D  is  a  redundancy 
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fc 

in  the  notation:  the  ket  vector  |  $t  >  can  be  abbreviated  to  read  1  i  >. 
The  Hilbert  space  for  a  specific  physical  problem  is  built  on  the 
cononical  coordinates  QVQ2*  •  •  -  Qk  f°r  a  system  of  fe-degrees  of 
freedom.  The  abstract  vectors  have  no  numerical  significance, 
rather  the  q'a  serve  to  label  what  will  be  considered  a  complete 
orthonormal  set,  yqvq2,  .  .  .  ,  qk.  The  ket  will  be  denoted  by  \ql9 
q2,  .  .  .  ,  qk  >.  In  the  (improper)  5-  function  normalization  scheme 
the  orthonormality  relation  is 


<qv  q2y  -  •  -  »  qk    ^i'^'  •  •  -  »  qk  >  =       i  - 
An  arbitrary  vector  ¥a  may  be  expanded  in  the  Vq  basis: 


=  J  .  •  •  /  *(ta(qvq2,  .  .  .  , 


where  the  expansion  coefficient  ^a(qvq2^  .  .  .  ,  qfe)  is  the  probability 
amplitude  for  finding  particle  1  at  position  ql$  etc.;  i.e.,  ^a(qvq2j 
.  .  .  ,  qk)  is  the  Schrodinger  wave  function  in  the  coordinate  repre- 
sentation. Taking  the  inner  product  of  |  To  >  with  the  bra  <qv  .  .  .qk\ 
and  using  the  orthonormality  relation,  one  may  verify  that 


In  quantum  mechanics  a  particle  may  have  a  discrete  internal 
degree  of  freedom,  spin,  for  example,  where  the  discrete  variable 
o  takes  on  either  of  two  values,  t  for  spin  up  or  i  for  spin  down. 
For  a  particle  with  spin  and  one  spatial  degree  of  freedom  x9  an 
arbitrary  state  %  may  be  expanded  as 


where  Oa(x)  is  the  probability  amplitude  for  finding  the  particle  at 
x  with  spin  a  (see  Chapter  16). 

Treating  now  a  particle  on  the  interval  -«>  <  *  <  «  with  no  spin, 
we  write  the  inner  product  of  two  vectors  Va  and  ¥&  by  first  ex- 
panding Wfr  in  terms  of  the  set  V, 


then  taking  the  inner  product  using  the  distributive  rule  under  the 
integral.  The  result  is 


(»a 


>»oo 

<k  =    / 

*r—<X) 
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the  ordinary  tunction  space  inner  product.  The  projection  operator 
Pab  onto  the  interval  a  <  x  <  b  is 


abV  -    f   ¥*<¥„ 
•/• 


,W)dx, 
or  in  the  Dirac  notation 


/"V, 


An  arbitrary  operator  M  is  expressed  in  terms  of  its  x-  space 
matrix  elements  <x\M\y  >  -  (Vx,MVy)  by  the  formula 

/OO         >»CO 
I      <x|M|y>  \VxxVy  \dxdy 
.0     */-00 

the  action  of  an  operator  M  on  a  (ket)  state  vector  |  *P  >  is  written  as 

/CO         v*CO 
/      |VJC:xx|M|yxy|M'>  dxdy  , 
JO     •/  -00 

where  it  is  now  appropriate  to  call  <-x|_M|y>  the  kernel  of  the 
integral  operator  M.  The  average  value  Mf  or  what  is  the  same 
thing,  the  expectation  value  of  M  in  the  state  <t>,  is 

M  --=  ( 


An  operator  V  is  called  local  if  its  matrix  elements  have  the 
special  form  <x|M|y  >  -  3(x  -  y)V(x),  where  3(x  -  y)  is  the  Dirac 
S-function  and  V(x)  is  an  ordinary  function.  The  expectation  value 
of  a  local  operator  is 


r°° 
V  -     / 

•/-co 


The  kinetic  energy  operator  T  has  the  matrix  element  <  x  \  T  \  y  >  = 

/»2 

-  ~  ^x8(x  -  y),   which  is  also  regarded  as  a  local  operator.  With 
tin 

the  help  of  Integra  tion-by- parts  the  expectation  value  of  T  can  be 
written 


T  —  —  ~— 
2m 


/*° 
I 

•/ -no 


where 


Appendix  B 

The  Statistical  Assertions  of  Quantum 
Mechanics 


We  consider  a  cononical  system  of  k  degrees  of  freedom,  em- 
ploying the  coordinates  g}  .  .  .  qk  to  specify  its  configuration  and 
the  cononically  conjugate  momenta  to  specify  its  condition  of 
motion.  In  the  wave  mechanical  mode  of  description,  in  the  co- 
ordinate representation,  everything  that  can  be  said  about  the 
"state  of  the  system  at  one  time"  (its  configuration,  condition  of 
motion  and  the  values  of  all  its  physical  quantities)  must  be  de- 
rived from  the  wavefunction  <f>(q^  .  .  .  qk).  The  functions  admitted 
as  wave  functions  are  those  that  are  square  integrable  (normaliz- 
able,  ||  0||  finite)  and  furthermore  normalized  to  unity, 

y»00  y»00 

/       ...  /       \<t.(ql,---tqk)\Zdql.^dqh   =    1, 

*^— 00  *^  — CD 

although  in  applications  to  continuous  spectra  this  requirement 
may  be  relaxed.  There  are  three  primary  statistical  assertions: 
(1)  the  probability  of  finding  the  system  within  a  volume  V  of  con- 
figuration space  is 


(2)  if  the  energy  of  the  system  has  the  operator  H  with  eigenvalues 
Ei,E2»  •  •  -  and  eigenfunctions  ^^^  •  •  •»  tlien  *e  probability  that 
the  system  has  the  energy  value  E7  in  the  state  0  is 


I       -co  -co 

/    '  '  '  /     ^M\  -  •  • 

I  ./-co  •/-co 


and  (3)  the  average  value  of  a  physical  observable  81  to  which  the 
operator  A  corresponds  is 


/  /      0<gt,  •  •  •  ,  qk)*A   fo     .  .  .  ,  ^    ; 

»/_oo          */-oo  \ 
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A 

in  the  quantum  state  <£.  The  interrelation  of  these  three  assertions 
will  now  be  examined. 

In  order  to  present  assertion  (1)  in  a  general  form,  we  intro- 
duce the  projection  operators  Py(/y),  ;  =  !,•••,*,  associated  with 
the  intervalue  l}  defined  by  q]  <  QJ  <  qj.  The  projection  operator  is 
defined  by 


0 


The  projection  operator  associated  with  the  (rectangular)  volume 
V  is  Pi(/i)P2(/2>  •  •  •  P*C*)  and  the  integral  specifying  the  prob- 
ability of  finding  the  particle  described  by  0  in  V  may  be  reduced 
to  the  expression 


For  assertions  (2),  let  Psn  denote  the  projection  operator  associ- 
ated with  the  eigenvalue  En"of  H,  that  is, 

PEn  1/fm    ~    ^n    f°r    m  =  HJ    ^d    =    0    for    m  j4  tl  . 

The  probability  of  finding  the  value  En  upon  a  sharp  measurement 
of  the  energy  of  a  system  in  the  state  $  can  then  be  put  in  the  form 


The   probability   of  finding   the  system  within  an  energy  interval 
E'<  E  <:E"is 


where  P(/E)  is  the  projection  operator  associated  with  the  interval 

P«E>  -       Z      Pzn  - 


Assertions  (1)  and  (2)  may  now  be  unified  in  what  is  called  "the 
most  general  probability  assertion  possible": 

Statistical  Postulate:  The  probability  that  in  the  state  0 
the  physical  quantities  with  the  operators  Ai,A2,  •  •  -  ,Amtake 
on  values  from  the^  respective  intervals  /j,/2»  •••»'«  is 


where  Pi,P2,  .  .  .  Pm  are  the  projection  operators  belonging 
to  the  operators  AI  .  .  .  Am. 
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Assertion  (1)  has  m  =  fe,  with  A\  -  q\,  A%  -  q^  •  -  •  »  A^  =  <?&;  while 
assertion  (2)  has  m  =  1,  with  AI  -  H.  In  order  to  insure  that  this 
postulate  is  a  coherent  statistical  statement  the  following  prop- 
erties must  be  varified: 

(a)  Since  the  order  of  the  operators  is  arbitrary  in  formulating 
the   statistical  question,  the  order  of  the  projection  operators  in 
the   statistical   assertion  must  be  immaterial.    This  implies  that 
the   projection  operators   must   commute  for  arbitrary  intervals 
/b/2»  •  -  -  Jm ,    and   this   in   turn  implies  that  the  operators  Aj,A2» 
.  .  .  ,Am   must  commute  among  themselves. 

(b)  Vacuous   propositions  may  be  inserted  at  will,  vis.  if  the 
interval   /'  is  contained  within  the  interval  /,  P(/)P(/')  -  P(/'),  or 
the   projection   operator   for   obtaining  any  value  whatever  is  the 
identity. 

(c)  If  the  interval  13  lies  outside  the  spectrum  of  A/,  for  some  ;', 
the  corresponding  projection  operator  is  the  null  operator  and  the 
probability  is  zero. 

(d)  Probabilities  are  additive;  i.e.,  if  an  interval  /  is  resolved 
into  two  disjoint  subintervals  /  -  /'  +  /",  with  operators  P</')  and 
P(/")  corresponding  to  /'  and  l"t  respectively,  then 

||P(/)<D||2  -   ||  PC/')  (D  ||2  +   ||P(n<D||2. 

(e)  The    total   probability  W  ranges  over  the  values   0  <  W  <  1 
for  normalized  <1>. 

Assertion  (3)  at  the  beginning  of  this  appendix,  can  be  expressed 
in  terms  of  the  inner  product.  If  we  introduce  the  notationExp{9,<I>i 
to  stand  for  the  statement  "the  expectation  value  of  the  physical 
observable  SI  in  the  state  <V'  assertion  (3)  may  be  stated 

Exp!3(,<J>j  =,  A  •=  (<D,A<D), 

where  A  is  the  operator  corresponding  to  81.  Let  F(A)be  any  func- 
tion of  the  real  parameter  A.  There  is  a  theorem  to  the  effect  that 
if  the  observable  9  has  the  operator  A,  then  the  observable  F(S) 
has  the  operator  F(A).  Assertion  (3)  may  then  be  generalized  to 
read 


ExpiF($l),4>!   =   F(A)  --   (O,F(A)0)  (3') 


Assertion  (3')  can  be  derived  from  the  statistical  postulate  as 
follows:  subdivide  the  interval  (-<«,  ~)  into  a  sequence  of  subinter- 
vals /„  =  \\n,  Xn  +i  I  .where  -«»  <  A-n  <  •  •  •  <  i  A-i  <  AO  <  AI  <  •  •  •  >  A«  < 
+  00.  Let  \n  be  some  number  in  the  interval  A;  <  A;  <  A;+i.  The 
average  value  F(A)  in  this  mesh  is 


FXA)  -  F  (  A;) 

/=  -n 
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If  we  introduce  the  monotonic  increasing  projection  operation  E(A) 
defined  by 


A;<A 

(conversely,  P(/7)  -  E(A;+i)  -  E(A,))  and  let  the  n  ->  <*>  and  the  mesh 
size  vanish,  the  sum  approaches  the  Stieltjes  integral 


which  by  definition  of  a  function  of  an  operator  is  (<J>,F(A)<J>).  So 
far  we  have  indicated  that  assertions  (3)  and  (3')  follow  from  the 
Statistical  Postulate.  What  is  remarkable  is  that  the  converse  is 
also  true:  the  Statistical  Postulate  follows  from  assertion  (3).  The 
proof  of  this  statement  is  obtained  by  a  technical  application  of  the 
following  theorem:  Let  Ai,A2,  .  .  .  ,ATO  be  a  set  of  mutually  com- 
muting operators;  there  exists  an  operator  R  and  functions  Fj(  A), 
F2(A)  .....  Fm(A)  such  that  AI  --  Fi<fl),A2  -  Fo(tf)  .....  Fm  -  Fm(R). 
In  the  preceding  discussion  we  have  assumed  that  the  state  Is 
a  pure  state,  that  is,  it  is  described  by  single  vector  <J>*  In  general, 
in  the  physical  preparation  of  a  state  some  of  the  variables  are 
left  uncontrolled  and  the  state  Is  not  completely  specified.  This 
situation  can  be  formulated  in  terms  of  a  classical  probability 
distribution  that  is  superimposed  on  top  of  the  quantum  mechanical 
uncertainty.  Let  us  suppose  that  the  system  is  in  one  of  the  states 
V0,a  =  1,2,  .  .  .  ,  but  we  don't  know  precisely  which  one.  Let  Wa  be 
the  classical  probability  that  the  system  is  in  the  state  Wa.  The 
basic  statistical  postulate  can  now  be  reformulated  as 

^    -     E  W  PlUl)P2</2>  '   >   •  Pm^m)  Vail2 

a 

and  assertion  (3)  can  be  generalized  to  read 

Expl  K^o.VJ  -  !>«<1'a,AVa) 

a 

The   states   Va   are   an   arbitrary   set  of   states;  they  need  not  be 
mutually   orthogonal.    For  the  Wa's  it  is  required  that  Wa>  0  and 

S\Va-    1. 

The  statistical  assertions  can  also  be  formulated  in  terms  of 
the  trace.  The  trace  of  an  operator  A  is 


trA  -         (^.AV,)  , 
i 

where  the  sum  runs  over  a  complete  orthonormal  basis  Vi,V2,  .... 
The  trace  is  independent  of  which  basis  is  used  to  define  it.  The 
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statistical  assertions  are 

W    -    trpPi</i)P2</2>'  '  '  Pm</m> 

and 

Expl  a^Vj  =  trpA 
where  p  is  the  density  matrix  (operator) 

n     -     V*   IV     P  V  I  *P      ^W<WI 

P    -     /  *  "a  *  a.    -     /  ^  !  Ya  '    "a  ^  Ta.[ 

a  a. 

The  density  matrix  is  useful  for  making  a  general  statement  about 
the  time  evolution  of  the  system.  Expectation  values  change  with 
time  according  to  the  general  rule 

-    ExplH;tfa,¥a|  -  tripA  +  PA  \  . 
dt 

In  the  Schrodinger  picture  the  burden  of  change  is  put  on  /?;  namely, 
A  -  0  and  +  ih  p  ^  Up  -  pll,  where  H  is  the  Hamiltonian.  In  the 
Heisenberg  picture  p  =  0  and  ~i~K  A  -  HA  -  AH,  (see  Problem  8.4). 
The  Hamiltonian  controls  the  evolution  of  an  isolated  system.  When 
the  system  is  subjected  to  a  measurement  of  one  of  its  observ- 
ables  ft  and  the  eigenvalue  R'  obtained  the  uncontrollable  disturb- 
ance of  the  measuring  process  forces  the  system  discontinuously 
into  the  state  described  by  the  eigenvector  WW'  corresponding  to 
the  eigenvalue  R\  If  the  measurement  of  R  is  repeated  (before  the 
evolution  generated  by  the  Hamiltonian  moves  the  system  into 
another  state)  the  system  is  already  in  an  eigenstate^'of  #  and 
thus  yields  the  value  R'  with  certainty. 


Problems 


Chapter  1 


1.1  Express   the   space   and  time   dependence   of  E,  H  and  A  in 
terms    of    amplitudes   and  phase   angles   for   a   plane  wave 
moving  in  the  positive  x  direction.  What  are  the  conditions 
for  plane  or  circularly  polarized  light?  What  is  the  initial 
data  at  t  =  0  for  solution  of  Maxwell's  equations  or  the  wave 
equation  for   A?    Express  the  solution  in  terms  of  a  power 
series  in  time.    Express  the  solution  for  t  <  0  in  terms  of  a 
solution  for  t  >  0  with  appropriate  initial  data. 

1.2  Determine  the  magnitudes  of  E  and  H  for  unpolarized  sun- 
light characterized  by  the  solar  constant  (Poynting  vector) 
Sx  =  2  cal^cm^mln"1.   What  is  the  ratio  of  electric  to  mag- 
netic force  on  an  electron  moving  with  the  speed  of  an  elec- 
tron in  the  first  Bohr  orbit? 

1.3  Compare  the  wave  and  photon  descriptions  of  normal  reflection 
of  sunlight  from  a  mirror.  What  is  the  energy  density,  pres- 
sure  against  the   mirror  and  the  density  of  photons?  Take 
X   =  55 00 A,    What  is  the  electric  field  intensity  of  a  photon 
absorbed  in  a  1  cc  detector? 

1.4  Find  an  expression  for  the  angular  momentum  density  of  an 
electromagnetic  field.   Show  that  a  photon  carries  an  angular 
momentum  of  magnitude  H.  Calculate  the  torque  on  a  quarter 
wave  plate  by   a  normally   incident  left- circularly   (right- 
screw)  polarized  light. 

1.5  Determine   the   spectral  density  of  radiation  under  the  as- 
sumption that  the  oscillator  representing  the  behavior  of  the 
walls  for  angular  frequency  o>  can  assume  the  energy  value 
E  =  0  or  any  one  of  a  continuum  of  values  E  >  »o.  Show  that 
the  gap  in  the  energy  spectrum  is  related  to  the  behavior  of 
P^  in  the  quantum  region  kT  «  7to>. 

Answer.   The  partition  function  Z  -  2e"aEn  =  1  +  e'^/m*. 


Chapter  2 

2.1     Consider   the   scattering   of  4   MeV  alpha  particles  on  gold 
atoms  (Z  =  79).  Show  that  the  distance  of  closest  approach  is 

rmm  =    (ZZ'e$/2E^  (l  f  csc|0)  and  evaluate  it  for  scattering 
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angles  5°,  20°  and  80°.  At  what  angles  will  the  scattering 
deviate  from  the  Rutherford  formula? 

2.2  Consider  the  scattering  of  alpha  particles  from  gold  foil.  A 
radium  source  yields  4.8  MeV  a-particles  at  a  rate  of  3.7  x 
1010  particles   per   second  per  gram  of  Ra.    The  target  has 
5.9    x    1022  atoms-cm"3,    a   thickness   of   4    xlO~5cmfanda 
cross  sectional  area  of  8  mm2.    The  target  is  situated  1  cm 
from  the  source.  Consider  a  detecting  screen  of  2  mm2  area 
5  cm  from  the  target.  What  amount  of  radium  is  required  for 
a  counting  rate  of  30  per  minute  at  scattering  angles  10°  and 
120°? 

2.3  Derive    (2.46)    for    a   general   one- dimensional   potential  by 
evaluating   dl/dE  directly.    Reduce   the   validity  of  the  Bohr 
quantization   rule   for  large  n  by  equating  the  classical  ex- 
pression for  the  frequency  of  the  expression  in  terms  of  the 
energy   level  spacing.    (This  application  of  the  correspond- 
ence principle  shows  that  (2.46)  and  (2.47)  give  a  consistent 
classical  limit  if  the  same  constant  ft  appears  in  both  equa- 
tions.) 

2.4  Examine  the  classical  and  quantum  descriptions  of  the  emis- 
sion of  light  from  a  harmonic  oscillator.  On  the  basis  of  the 
correspondence  principle,  argue  that  the  n  appearing  in  the 
Bohr  quantization  rule  is  the  same  Planck's  constant  char- 
acterizing the  corpuscular  nature  of  light.  Deduce  the  energy 
level    spacing   and   infer   the   levels  between  which  electric 
dipole  transitions  are  possible  (selection  rules). 

2.5  Use  the  correspondence  principle  to  infer  the  energy  levels 
of  a  rigid  rotator  consisting  of  a  mass  m  held  at  distance  b 
from  a  fixed  axis  of  rotation.  Note  that  the  angular  momentum 
is  independent  of  the  mechanical  parameters. 

Show  that  for  circular  motion  in  a  general  centrally  sym- 
metric field,  the  angular  momentum  is  quantized  in  steps 
of  H. 

2.7  Find  the  energy  levels  of  circular  orbits  in  the  Bohr  plane- 
tary model  of  the  hydrogen  atom  using  the  correspondence 
principle  directly.   Note  that  as  usual  n  is  left  undetermined 
up  to  an  additive  constant  of  integration.  Infer  the  selection 
rules  for  electric  dipole  transitions  between  circular  orbits 
and  find  the  accompanying  angular  momentum  change. 

2.8  Estimate  the  lifetime  of  a  stationary  Bohr  orbit  by  computing 
the  time  required  for  the  corresponding  classical  motion  to 
radiate  away  an  amount  of  energy  equal  to  the  quantum  energy 
level  spacing.  Compute  the  cumulative  time  for  several  suc- 
cessive transitions  between  circular  orbits  and  compare  it 
with   the   time    taken   to  radiate  the  same  amount  of  energy 
classically. 

2.9  Use  the  Bohr  quantization  conditions  to  determine  the  energy 
levels  of  an  isotropic  three-dimensional  harmonic  oscillator. 


8<6 
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Let  the  potential  energy  be  i  ma>2r2  and  use  polar  coordinates 

in  the  plane  of  the  classical  motion. 

2.10  Find  the  energy  levels  of  the  relativistic  linear  harmonic 
oscillator  according  to  the  Bohr-Somerfeld  quantization  rule. 
Evaluate  the  lowest  order  correction  to  the  non- relativistic 
energies.  The  potential  may  be  introduced  via  the  scalar 

potential    e$  -  ~kx2    or  by  replacing  the  mass  invariant  by 

M0c2  +  l  kx2. 

£* 

Chapter  3 

3.1  Calculate   the  deBroglie  wave  lengths  of  the  following  par- 
ticles each  with  a  kinetic  energy  500  keV:  photon,  electron, 
proton,  and  alpha  particles.  Also  of  thermal  oxygen  atoms  at 
300°K. 

3.2  What  is  the  dispersion  law  a>  -  a>(k)  if  the  group  velocity  is 
inversely   proportional   to   the   phase  velocity?   How  is  this 
case  realized  physically? 

3.3  Compare    the   reflection   of  a   particle   and   a   wave  from  a 
moving   surface.    Show  that   AE/Ao  -  Ap/A/e,  where  AE(Ap)  is 
the   change    in   the   energy    (momentum)    of  the  particle  and 
Ao>(Afe)  is  the  change  in  frequency  (wave  number)  of  the  wave. 
Show  this  independently  of  the  dispersion  law  of  the  wave  and 
the  energy- momentum  relation  of  the  particle.  How  can  this 
result  be  generalized  and  what  is  its  significance? 

3.4  Calculate  the  deflection  of  a  charged  particle  by  a  thin  slab  of 
magnetic  field,  expressing  the  change  of  momentum  in  terms 
of   the   vector  potential.    How  is  the  difflection  explained  in 
terms  of  the  wave  picture?  What  is  the  relation  between  mo- 
mentum and  wave  vector  in  the  presence  of  a  magnetic  field? 

Answer.  £k  =  0  across  the  slab  and  p  +  Ae/c  --  Tfk. 

3.5  Consider  a  particle  bouncing  back  and  forth  in  a  rigid  box  of 
linear   dimension  L.    What  is  the  minimum  measurable  ki- 
netic energy  of  the  particle?   How  much  energy  is  required 
to   constrain   an  electron  to  remain  within  a  volume  of  nu- 
clear size? 

3.6  Determine   the    maximum   time  a  free  particle  will  remain 
within  a  volume  of  radius  R  by  considering  the  limitations  on 
the  specification  of  the  initial  data  in  the  classical  descrip- 
tion of  motion. 

3.7  Find   an  uncertainty  relation  connecting  angular  momentum 
and  angular  orientation.   Consider  a  rigid  rotator.   What  is 
the  minimum  uncertainty  in  L? 

3.8  One  billiard  ball  bounces  on  another  with  a  center- to-center 
height  of  ten  times  the  radius.  What  is  the  optimal  horizontal 
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localization  to  maximize  the  number  of  bounces?  What  is  the 
maximum  number  of  bounces? 

Answer,  n  ~  50. 

3.9  What  is  the  optimal  localization  of  an  ideal  pendulum  to 
maximize  the  time  it  will  remain  balanced  in  an  inverted 
position  and  what  is  the  maximum  time? 

Answer.  About  six  times  the  period  for  small  oscillations. 

Chapter  4 

4.1  Under  what  circumstances  can  a  narrow  potential  be  approx- 
imated by   a   ^-function?   What  is  the  effect  of  a  5-function 
potential  on  the  continuity  property  of  a  solution  of  Schrod- 
inger's  equation? 

4.2  Show   that   the   potential   V(x)  =  ~gd(x2  -  a2)   has   a    discrete 
anitsymmetric   eigenfunction  for  g  >  H2/m  and  not  one   for 
8  <  Hz/m. 

4.3  Consider  the  potential  V(x}  =  «,  for  \x\  >  L  and  V(x)  --=  -V0S(jc) 
for  |  x  |  <  L.  Show  that  for  VQ  >  0  there  is  one  eigenvalue  E  <  0 
and  that  it  remains  discrete  as  L  approaches  infinity. 

4.4  Find   a   complete  set  of  orthonormal  eigenfunctions  for  the 
potential  V  -  -|  VQ  |5(x)  on  the  interval  -<*>  <  x  <  <*>.  Check  the 
completeness  relation  (4.67). 

4.5  (Sturm-  Liouville  theorem)    Let  \l/\  and  t//2  be  solutions  with 
energies    E\   and    E2  of  two  Schrbdinger  equations  with  po- 
tentials V\  and  V2»  respectively.  Show  that 


f 


0*  [<E2  - 


for  any  interval  a  <  x  <  6.  Show  that  if  y>i  and  ift2&re  solu- 
tions for  the  same  potential  and  if  E2  >  EI,  then  i//2  has  at 
least  one  node  between  each  pair  of  consecutive  nodes  of  ^t. 
Over  the  entire  interval,  then,  ^2  has  at  least  one  more  node 
than  \l/lm  (The  eigenfunctions  are  ordered  in  energy  according 
to  the  number  of  nodes.  A  set  of  eigenfunctions  is  complete 
if  there  is  one  for  each  integral  number  of  nodes.) 
4.6  (Comparison  potential  for  discrete  states)  Consider  the  in- 
terval 0  <  x  <  M  with  the  boundary  condition  (7(0)  =  0.  Find 
the  potential  for  which  U  =  %n  is  a  zero  energy  solution  of  the 
Schrodinger  equation.  Show  that  the  potential  obtained  has  no 

discrete  eigenfunctions  for  n  real.   If  n  is  complex,  n  =  I  + 

2 

i'/3,  show  that  the  potential  has  arbitrarily  many  bound  (i.e., 
discrete)   states.   The  solution  may  be  taken  to  be  real  U  = 

Vx  cos  3  In*. 
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4.7  Let  VQ  be  the  comparison  potential  obtained  in  problem  4.6. 
Show  the  following: 

(a)  If   V  >  VQ   for  x  >  XQ,  there  is  no  discrete  eigenfunctions 
of  V  with  a  node  located  beyond  XQ. 

(b)  If    V  <  VQ   for    x  >  x0,   there   is  a  discrete  eigenfunction 
with    arbitrarily   many   nodes  beyond  XQ.    There  are  in- 
finitely many  discrete  states  of  V. 

(c)  If  V  >  VQ  for  0  <  x  <  XQ,  there  is  no  solution  for  the  po- 
tential V  with  E  <  0  that  has  a  node  in  the  interval  0  <  x  <  XQ. 

(d)  If  V  <  VQ  for  x  <  x0  (i.e.,  if  V  -»  -<*»  more  rapidly  than  V0), 
then  there  are  eigenfunctions  with  arbitrarily  many  nodes 
in  the  interval. 

4.8  Scale  the  function  in  problem  4.6  to  find  some  eigenfunctions 
of  the  potential  V  -  ~  for  x  <  a  and  x  >  6,  V  ^  -V0(l/x2  - 
l/b2)a2b*/(b2  -  a2)  for    0  <  a  <  x  <  6.    Is    there   a   restriction 
among  the  parameters   VQ,  a,  and  b  for  this  method  to  work? 
Can  one  find  a  complete  set  this  way? 


Chapter  5 

5.1     Let  a  particle  be  prepared  in  the  state  ^r(x,0)  =  V2a0(xo  -  x) 
explipx  f  a(x  -  XQ)!  at    t  =  0,   where    a  >  0,  XQ  <  0  and  0(x)  = 

i(l  +  signx).   Assume  that  no  forces  act  on  the  particle  for 

t  >  0.    Expand   «/>(x,0)  in  terms  of  energy- momentum  eigen- 
functions and  show  that  for  t  >  0, 

,i    *\        V8a&    expG'px  -  i8p2  +  /y2/4/3)    C™  e~L  dL 

u/\Xyt)    —    —    • 


(2a/3  +  ly)  J0       1  _  izL2 

where  p  =  ^/2m,  y  =  x  -  x0  -  2j9p,   and  z  -  (1  +  iy/2apf2.   In- 
terpret i/r  for  ^  -»  oo. 

5.2  Reconsider  the  initial  wave  function  specified  in  problem  5.1. 
Assume   the   particle   is   in  a  potential  field    V  =  -|Vo|S(x). 
Expand  ^(x,0)  in  terms  of  the  eigenfunctions  of  problem  4.4 
and  follow  the  motion  in  the  dispersion-free  approximation 
(3.20).  Compute  the  probability  of  the  particle  being  reflected 
by  the  potential.   What  is  the  probability  of  finding  the  par- 
ticle in  the  ground  state? 

5.3  The  initial  value  problem  of  the  time- dependent  Schrodinger 
equation  can  be  solved  in  the  form  of  an  integral  operator 

with  the  Green's  function  kernel  Gt(x,x')  =  £)  ^n^  tl*n(*')*  e~lt*ni 

n 

where    \l/n  is   a   complete  set  of  orthonormal  energy  eigen- 
functions. 

(a)  Find  the  differential  equation  and  boundary  conditions  for 
Gt  in  the  case  of  a  general  one-dimensional  potential. 
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(b)  Show  that  the  force-  free  motion  of  a  non-relativistic  par- 
ticle   in   an   s-  dimensional    Euclidian    space    is  given  by 

-'    -  s/2  '2 


Gt(iM-')   -  (m/2mWs     exp|«m(r  - 

(cf.  Problem  3.3  in  the  text.  How  can  Gt  serve  as  an  approx- 
imation to  the  S-function?) 

5.4  Find   the    angular   dependence   and  total  current  of  charged 
particles    flowing   from   a    source    at   the    origin.    The  wave 
function   far   from  the  source  is  cos6elkr/r  in  spherical  co- 
ordinates. 

5.5  A   particle   is    in  a   uniform  gravitational   field  in  a  region 
bounded  from  below  by  a  perfectly  reflecting  surface.  Adapt 
the  WKB  method  to  the  case  of  an  impenitrable  wall  and  show 
that  the  energy  levels  go  as(rc  \-  3/4)  2/3  for  large  n.  Show  that 
&„  -  £n_i  =  ft™  for  large  n,  where  w  is  the  classical  angular 
frequency. 

Chapter  6 

6.1  Find  the  probability  for  a  particle  of  charge  +//CQ  and  speed 
v  to  penetrate  through  the  Coulomb  barrier  to  a  nucleus  of 
charge  /^o.  (Gamow  factor) 

6.2  Derive  (6.35),  using  integration  by  parts  to  pick  out  terms  of 
order  T2  in  Ejv. 

6.3  Find  the  pressure  exerted  by  the  walls  to  contain  a  particle 
in   a   very  deep  potential  well  of  volume  V  at  absolute  zero 
temperatures. 

6.4  Find  a  relation  between  the  volume  and  pressure  of  an  elec- 
tron gas  at  0°K. 

6.5  Show   that   (6.66)    holds   for   an  arbitrary  periodic  potential 
(Block's  theorem). 


Chapter  7 

7.1  Express  1/r  as  an  integral  operator  in  the  momentum  repre- 
sentation. ______ 

7.2  Find  the  kernels  for   the   expression  of  p2  and  Vp2  f  m20c2  as 
integral  operators  in  the  coordinate  representation. 

7.3  Evaluate  the  commutator  pxV(r)  ~  V(r)px. 

7.4  Let  ft(r,p)  be  a  polynomial  in  the  components  of  r  and  p.  Show 
that 


- 

dpx         1i 

and  indicate  explicitly  the  meaning  of  the  partial  derivative. 
7.5      (a)  Find  an  expression  for  the  operator  (A  -  zB)~l  in  powers 
of  2,  where  A  and  B  are  non-commuting  operators. 
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(b)  Let  A  and  B  be  IV  x  IV  matrices  and  let  zi  be  the  root  of 
the  equation  det(A  -  28)  -  0  which  is  smallest  in  absolute 
value.  Prove  that  the  expansion  of  part  (a)  converges 
whenever  |  z  |  <  |  z\  \  and  diverges  whenever  |  z  |  >  |  z\  |. 

7.6  Show  that  in  general  any  quantum  mechanical  operator  can  be 
expressed  as  an  integral  operator. 

7.7  Find  an  expression  for  the  translation  operator  Ta,  defined 
by  Ta^(x)  -  i//(x  +  a). 

7.8  The  solution  of  the  initial  value  problem  of  the  time-  dependent 
Schrodinger  equation  can  be  expressed  in  terms  of  an  oper- 
ator U(t)  that  transforms  the  wave  function  at  time  t  -  0  into 
the  wave  function  at  time  t\  that  is,   i//(t)  -  U(t)  i/y(0). 

(a)  Upon   what   properties    of  the  Schrodinger  equation  does 
this  depend? 

(b)  Express  11  (t)  as  an  integral  operator  in  terms  of  a  com- 
plete set  of  energy  eigenfunctions. 

(c)  Evaluate  the  kernel  of  the  integral  operator  V(t)  for  the 
motion  of  a  free  particle. 

7.9  The  concept  of  the  quantum  mechanical  average  value  has  its 
natural  generalization  in  the  notion  of  the  "inner  product"  in 
the  representation  space  (see  Appendix).  The  average  value 
M  -  (MMVW    is    the   inner   product   of  V   with   the  vector  Mf. 
Demonstrate  the  following: 

(a)  Let  ¥„  /-  I...™,  be  a  complete  set  of  basis  vectors 
which  is  orthonormal  with  respect  to  the  inner  product, 
(¥i,  Wfr)  =.  fizfe.  Show  that  the  action  of  an  operator  M  is 
completely  determined  by  its  matrix  elements  M^  -  (V,, 


(b)  An   operator    U    is    Hermitian   if   (^//O)  -  (OT,<t>)    for  all 
vectors  V  and  <£.  Show  that  the  eigenvalues  of  /]  are  real, 
the  eigenvectors  corresponding  to  distinct  eigenvalues  are 
orthogonal,  and  there  exists  a  basis  in  which  H  is  diagonal. 
Find  the  conditions  on  the  matrix  elements  Htk  such  that 
11  is  Hermitian. 

(c)  If  (VJQV)  -  0   for  all  vectors  V  and  if  (0  is  Hermitian,  then 
(0,  OT)  =  o   for  all  ¥  and  $.  One  may  then  assert  the  oper- 
ator equation  (0  -  0. 

(d)  If  (%  A$)  =  (OT,<I>)  for  all  ^  and  4>,  then  B  is  said  to  be  the 
Hermitian  conjugate  of  A  and  is  given  the  special  notation 
B  -  A+.    Find  the  relation  between  the  matrix  elements  of 
A    and  A+.    Show  that  A   is  the  Hermitian  conjugate  of  A4 
and   that   a   Hermitian   matrix   is  self-conjugate,  H  =  H+. 
Express  the  Hermitian  conjugate  of  the  operator  product 
AC  in  terms  A*  and  C\ 

(e)  Show   that   if  the  inner  product  is  positive  definite,  i.e., 
(<J),<f>)  >  0  for  all  $  but  the  null  vector,  then  the  eigenvalues 
of  AA*  are  non-negative. 

(f)  An  operator  (/  is  unitary  if  <W,U<1>)  -=  (^,<J))  for  all  W  and  $. 


518  PROBLEMS 

If  %,  i  =  !...«,  is  an  orthonormal  basis,  then  ^J  =  U^\  is 
also.  Find  the  conditions  on  the  matrix  elements  Utk  such 
that  U  is  unitary.  Show  that  U  is  non- singular. 

(g)  Show  that  the  operators  defined  in  problems  7.7  and  7.8 

are  unitary. 
7.10   If  A  and  B  are  N  x  N  matrices  and  if  AB  =  I,  show  that  BA  =  /, 

/  being  the   N  x  N   unit  matrix.   Construct  a  counter  example 

to    show    this    result    is    not   true   in  general  for  "infinite 

matrices, "i.e.,/V  »  «>. 


Chapter  8 

8.1     Show  that  the  commutation  relations  between  the  operators 
for  the  velocity  components  of  a  charged  particle  in  a  mag- 

netic field  are  vxuy  -  vyvx  -  l—-  Hz  and  cyclic  permutations. 


8.2     Show  that  if  V,  and  ^^  are  energy  eigenfunctions  and  if  M  does 
not  depend  explicitly  on  time,  then  the  time  dependence  of  off- 

diagonal  matrix  elements  is  givenby 


at 
where  fau^  -  Ez  -  EJ9 

8.3  Let  U(t)  be  the  operator  that  generates  the  solution  WU)  of  the 
time-  dependent   Schrodinger   equation    (with   Hamiltonian  H) 
from  arbitrary  initial  data  V(0)  in  the  form   ¥(*)  -  U(t) 

See  problem  7.8. 

(a)  Show   that    U(t)   satisfies    the  operator  equation 
HU(t),  with  initial  data  U<0)  =  1. 

(b)  Deduce  from  the  differential  equation  that  U(t)  is  unitary. 

(c)  Specify  under  what  conditions  and  in  what  sense  U(t)  may 
be  written  as  U(t)  =  exp(-i////^). 

8.4  In   the  Schrodinger  picture  observables  are  represented  by 
operators  that  do  not  depend  explicitly  on  time,  the  change  of 
dynamic  variables  with  time  being  described  by  the  change  of 
state  vector  V(t).  In  the  Heisenberg  picture  the  time  depend- 
ence is  transferred  to  the  operators  themselves  by  means  of 
the  unitary  operator  in  problem  8.2.  The  procedure  is  M(t)  = 
(VU)fM¥0))  -  (W0,h(f)¥0)f  where  V0  =  ¥(0)  is  the  state  vector 
at  /  =  0  and  where   ft(«)  -  U(trlMU(t). 

(a)  Show    thatiTztfu  M  -  tfft,     where    M  =  U(trlHU(t).    If  H  is 
independent  of  time  #  =  H.    This  is  the  Heisenberg  equa- 
tion of  motion. 

(b)  If  AB  -  BA  =  /C,  then  (IB  -  I3Q  =  iC. 

(c)  Derive  an  expression  for  the  time  derivative  of  the  op- 
erator product  Q(f)@(t). 

(d)  Find  the  position  operator  in  the  Heinsenberg  picture  for 
the  motion  of  a  free  particle.   Express  the  result  in  the 
coordinate  representation. 
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(e)  Using  the  position  operator  in  part  (d),  find  the  time  de- 
pendence of  the  variance  of  the  position  of  a  freely  moving 
particle. 

8.5  Show  that  the  mean  value  of  the  kinetic  energy  in  a  state  be- 
longing to  the  discrete  spectrum  is  related  to  the  mean  value 
of  the  potential  energy  by  the  relation  2T  =  (r  •  VV).  If  y  is  a 
homogeneous  function  of  degree  a,  V(Ar)  =  \aV(r)f  then  2T  =  aV 
(virial  theorem). 

8.6  Verify  the  following  "sum  rules." 


(b)  Z]  6>lfc  |  r,*  |2  =   -—  ("F-sum  rulew 

*  2m0 


2    Tfe 
mo 


(e)  2-  <^^  I  Ttk  I     -  diverges 


Here  7*«ljfe  =  E,  -  E*.  Note  that  <yljfe  >  0  if  k  designates  the 
ground  state.  Sum  rules  are  important  because  they  provide 
a  means  of  testing  the  physical  content  of  the  theory  even 
though  the  mathematical  problem  cannot  be  solved  com- 
pletely. 

Chapter  9 

9.1  Formulate  the  set  of  coupled  differential  equations  governing 
the  population  of  a  large  number  of  quantum  states.  Show  that 
the  total  number  of  levels  occupied  does  not  change  with  time. 
Identify  the  conditions  under  which  the  occupation  numbers 
are  positive  definite.  Derive  the  relations  among  the  Einstein 
coefficients   which   specify   the   approach  to  thermodynamic 
equilibrium. 

9.2  Compute    the    current  density   of  the  non-stationary   state 
which  describes  the  transition  from  the  2p  to  the  Is  state  of 
hydrogen  and  calculate  the  radiation  field  from  this  source. 
Identify  the  angular  distribution  and  state  of  polarization  for 
each  value  of  the  magnetic  quantum  number  m.  See  Chapter  13. 

Chapter  10 

10.1  At    what  quantum  level  can  a  wave   packet   in  a  harmonic 
oscillator  be  localized  to  1%  of  the  total  excursion?  Use  the 
uncertainty  principle. 

10.2  Find  the  stationary  states  of  a  linear  harmonic  oscillator  in 
a  uniform  electric  field. 
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10.3   Suppose   that   at    t  =   0,    a  particle  in  the  potential  field  V 
-       22 


-  mo>2x2  is  described  by  the  wave  function  ^(x^))  =  const. 
2 

exp    iikQX  -  -a2(x  -  *o)2/»    where  a2  =  mo>/X    Calculate    the 

probability  amplitude  for  the  particle  being  in  each  of  the 
energy  eigenstates.  Find  the  wave  function  for  /  >  0.  Discuss 
the  spreading  of  the  wave  packet. 

00 

10.4  Show  that  e-s**2st        £  -"  Hn(0  is  the  generating  function  for 

n  =  o    n1 

Tchebycheff-  Her  mite  polynomials.  Use  it  to  establish  the 
appropriate  orthonormality  relations. 

x»00 

10.5  Show  that          lln(x)e-**e"'*d*  -  \/n  lmpme-P2/4. 


10.6  Infer  the  matrix  elements  of  the  position  operator  of  a  har- 
monic oscillator  from  the  spectrum,  selection  rules  and  the 
F-sum  rule. 

10.7  Solve    the  Heisenberg  equations  of  motion  to  find  the  time- 
dependent   Heisenberg   operators   for   the   position  and  mo- 
mentum of  a  harmonic  oscillator. 

10.8  Find  the  time-dependent  Green's  function  for  the  harmonic 
oscillator.    Normalize  by  comparing  it  to  the  free  particle 
Green's  function  at  a  time  ait  -'  1. 

10.9  Study  the  harmonic  oscillator  using  the  operator  properties 
of  the  variables 


a   - 


\fCh/2mcj)  d/dx  , 


V< 


x   - 


Prove  the  following  statements: 

(a)  The  Hamiltonian  operator  is  H  -  a*  a  +  |  where  the  energy 

is  measured  in  units  of  7?w. 

(b)  The  commutation  relations 

aa*  -  a+  a  -  1 
Ha  -  aH  ^  -a 
Ha+  -  afH  -  a+ 

hold  as  operator  equations. 

(c)  The    Hermitian   conjugate    (or    adjoint)    of   a    is    a+.   H  is 
Hermitian  (self-  adjoint). 

(d)  If   Ur  is  an  energy  eigenfunction  with  eigenvalue  c,  then 
aUe     is  an  eigenfunction  with  energy  e  -  1  and  (a*)nUe  is 
an  eigenfunction  with  eigenvalue  t  +  n.  This  gives  a  series 
of  equally  spaced  energy  levels. 
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(e)  There  is  a  finite  lowest  eigenvalue. 

(f)  The  series  must  terminate  at  the  lower  end  by  arriving  at 
a    "ground   state"  UQ  that  satisifes  the  equation  alJQ  ^  0 
(=  the  null  vector).  Solve  this  differential  equation  for  UQ 
and  normalize. 

(g)  The  "excited  states"  are  (;„  -  (n\yl/z(a*YlUo  with  energies 

(n  +  ~\  Tzw.This  gives  a  complete  set  of  orthonormal  eigen- 

functions. 

(h)  Determine  the  matrix  elements  of  a,  a+,  x  and  p  by  alge- 
braic means.  Show  that  there  are  no  finite  dimensional 
matrices  which  satisfy  the  specified  multiplication  prop- 
erties. 

Chapter  11 

11.1  Prove  the  orthogonality  and  normalization  condition  of  Le- 
gendre   polynomials   using  the  generating  function  (1  -  2rx  + 

r2)-*   =    E  r»PnM. 

n-  0 

11.2  Derive  the  commutation  relations  between  the  components  of 
angular   momentum   and   the  components  of  (a)  the  position 
operator,  and  (b)  the  momentum  operator. 

11.3  Let  Ax,  Ay  and  Az  satisfy  the  commutation  relations  LxAy  - 
AyLx  -  iHAZ9  LVAX  -  AxLy  -  -iHAz,  LrAT  -  Ax  Lx  -  0,  andcyclic 
permutations,  with  respect  to  the  angular  momentum  oper- 
ators.   Such   operators  are  called  "vector  operators";  ex- 
amples are  r  and  p.  Prove  the  following  relations: 

(a)  LXA2  -  A2LK  =  0 

(b)  LX(A  .  L)  -  (A  •  L)LX    --  0 

(c)  L2AX  -   AXL2    -   iH(AvLt  +  LzAy  -  A,LV  -  LVA,) 

(d)L'2(l*Ax  -  A.L2)  -  (L2AX  -  AXL2)L2  -= 

2(L2AX  +  AXL2)  -  4LX(A.L) 

'  >   '  '  i       i   ' 

/^v     i/i  I\/AN"*^  jr  A  \  '*    *    /  r   \  '  "t 

(e)/(/  f  D(^/m    .  (L  .  A)nl(L)lm  . 

11.4  Show  that  in  a  state  v>  with  a  sharp  value  of  L^,  Le<//  -  TO^,  the 
mean  values  of  Lx  and  Ly  are  zero. 

11.5  Suppose  a  system  can  be  resolved  into  two  weakly  interacting 
subsystems  1  and  2  so  that  the  total  angular  momentum  L  is 
LI  +  L.2.    If   the    subsystems  are  in  states  characterized  by 
definite  values  of  the  quantum  numbers  /i,  l\z  and  /2»  h*  re- 
spectively, what  _are  the  possible  values  of  L2  and  what  is  the 
average  value  of  L2? 

11.6  Express  the  spherical  harmonics  for  1  =  0,  1  and  2  as  poly- 
nomials in  x,  y  and  2. 

11.7  Find  the  transformation  rule  of  the  spherical  functions Yn> 
YIQ  and  Y 1-1  for  a  rotation  of  the  coordinate  system  through 
Eulerian  angles  a,  ft  and  y. 
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11.8  Let  L±  -  Lx  ±  iLy.  Verify  the  following  relations: 

(a)  (L±r  -  LT 

(b)  L2  =  L2  +  £<L+L-  +  L-L+) 

(c)  L2L±  -  L±L2  -   ±HL± 

(d)  L+L-  -  L-L+   -  2fiL3 

(e)  L2Lt  -  LfL2  -  0 

(f)  L2L2  -  LsL2  =  0 

11.9  Study  the  way  in  which  the  multiplication  properties  in  prob- 
lem 11.8  can  be  represented  in  terms  of  matrices,  or  what  is 
the  same  thing,  linear  transformations  on  a  finite  dimensional 
vector   space.    Let  V\^  be  a  finite  set  of  degenerate  eigen- 
functions  of  L2  with  eigenvalue  A.  Here  n  labels  the  elements 
of  the  set.  Show  the  following: 

(a)  Lz  VXM  is  an  eigenfunction  of  L2. 

(b)  There  is  some  linear  combination  of  the  Vs  which  is  an 
eigenfunction  of  L^.    Denote  the  eigenvalue  by  mU  and  the 
particular  linear  combination  by  ^\m. 

(c)  L+V\m  is  an  eigenfunction  of  L2  with  eigenvalue  A. 

(d)  LtWXm  is  an  eigenfunction  of  Lz  with  eigenvalue  (m  ±  l)7z. 

(e)  From  the  condition  that  the  norm  of  L±\m  is  nonnegative, 
it    follows    (respectively   for  ±)    that  A  >  m(m  ±  l)^2,    the 
equality  holding  if  the  vector  is  null. 

(f)  The   condition   for   the  existence  of  finite  multipletes  is 
that  the  series  generated  by  (L±)n  terminates  at  both  ends. 
This  implies  that  A  has  the  form  /(/  +  l)7z2,  where  /  may 
have  the  values  0,  1/2,  1,  3/2,  ..„,  and  that  m  runs  over 
range  -/,-/  +  1,  ...,/-  1,  /. 

(g)  Find  the  matrices  of  order  2/  +  1  that  represent  the  action 
of  the  operators  Lz  and  L±  on  the  vector  space  spanned  by 
^/m'  *"'  - m  -  '»    ^or    a    ^xec^  value   of    /.    For   example, 
(lP/m,L2  V/m')  -  ('Vmm'  =  m^mm''    Select  a  phase  convention 
compatiable  with  (11.88). 

(h)  Write  out  the  explicit  matrices  for  /  =  1/2  and  verify  the 
commutation  relations  by  direct  matrix  multiplication. 


Chapter  12 

12.1  Find   the   energy   levels   and   eigenfunctions  of  an  isotropic 
three-dimensional  harmonic  oscillator.    Find  energy  eigen- 
functions that  are  also  eigenfunctions  of  L2  and  L2.  What  is 
the  degeneracy  of  each  level?   What  part  of  the  degeneracy 
stems  from  isotropy  of  the  potential? 

12.2  Write    out  the  explicit  /  =  I,  m  =  1,   Ot  -   1,  eigenfunctions  of 
lowest  energy  of  a  three  dimensional  spherically  symmetric 
harmonic  oscillator.   Express  these  eigenfunctions  as  linear 
combinations  of  the  solution  obtained  by  using  spearation  of 
variables  in  Cartesian  coordinates.   If  each  triplet  of  wave 
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functions  is  orthonormal,  show  that  the  matrix  of  coefficients 
of  the  transformation  is  a  unitary  matrix. 

12.3  For   the   isotropic   potential  V(r)  =  -g8(r  -  a),  determine  the 
range    of  g  for  which  there  is  an  /  =  0  discrete  state  but  no 
/  =  1  discrete  state. 

12.4  In  the  case  of  a  continuous  spectrum,  the  wave  function  far 
from   the  scattering  center  is  characterized  by  the  '  'phase 

shift"  8i(k)  defined  as  follows:   rRkj(r)  -  c  sin  (kr  -  !z  +  S/(fe)) 

for  kr  »  /  (kr  »  1  for  /  =  0),  where  Hk  =  \/2mEk.  For  a  "short 
range"  potential,  S/  is  independent  of  r  (see  page  222  and 
Chapter  28B).  Calculate  the  /  -  0  phase  shift  for  the  potential 
in  problem  12.3. 

(\Vi 
~J  Ji+i&(p)  3  jt(p)  occurs  frequently  and  is 

called  the  "spherical  Bessel  function."  The  second  solution, 
irregular  at  the  origin  p  =  0,  is  n/(p>  =  -W)'  ( — )  ^/.^^P^  con" 

sistent  with  the  definition  of  Neumann  functions,  Np  =  (Jpco8rrp  - 
J.p)/sin77p.  Verify  the  following  properties: 
(a)  differential  equation, 


/,2 
\ 


" 


(b)  explicit  form, 


sin 

~P 


sin  p        cos  p  cos  p         sin  / 

(c)  recurrence  relations, 

+    -        ^    21  +  1  .  .,  _       /  +  1  . 

/-I  p        J;  ,  Jt  p         i 

(d)  asymptotic  form  (p  »  /,  p  »  1), 


•  /  . 
;/(p) 


/  ^     L    /    /77^ 

w/(p)     ^    ___COS^p   -    —J 

(e)  behavior  at   P  =  0, 

2'Z' 

•    /    \  A(*  7 
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(f)  integral  representation  (see  page  202), 

»+i 


1  I 

2  -/-i 


(g)  orthonormality, 

j(/er)r2dr   =    — T  8(k'k') 

12.6  Find  the  conditions  under  which  the  spherical  potential  well 
V  _-_  _|  VQ  |  for   0  •"  r  --  a,    and  V  =  0  for    r  ->  a,    can  support  IV 
s-wave  bound  states,  but  not  Nil. 

12.7  In   the    limit   of    zero  kinetic  energy,  an  s-wave  phase  shift 
behaves  as,  80(k)  -  -ka,  where  a  is  the  "scattering  length." 
Find  an  approximate  relation  between  scattering  length  and 
the  energy  of  a  loosely  bound  state,  KH      0. 

Chapter  13 

13.1  Find  the  momentum  distribution  of  an  electron  in  the  Is,  2s 
and  2p  states  of  hydrogen. 

13.2  Calculate  the  lifetime  of  a  hydrogen  atom  in  the  2p  state. 

13.3  Give  an  expression  for  the  transition  rates  between  consecu- 
tive circular  orbits  in  a  hydrogen-like  atom. 

-1  °°         -1 

13.4  Show  that  (1  -  0     exp|\f/(l  -  01  -   £,  <fr»)     LkMtk  is  the  gen- 

k=  0 

erating  function  for  Leguerre  polynomials  L^(x)  -  Q^(*). 

13.5  At  what  quantum  level  is  a  muonic  atom  the  same  size  as  a 
normal  hydrogen  atom  in  its  ground  state? 

13.6  What  is  the  probability  of  finding  an  electron  with  quantum 
numbers  n,  I  inside  a  nucleus  of  radius  Rn? 

13.7  Show  that  if  the  energy  is  regarded  as  a  complex  variable,  the 
Coulomb    "scattering   amplitude'*  for  a  definite  value  of   /, 

\c  l<t  -  \)/2tk,  has    a    simple    pole  singularity  at  the  energy 
value  corresponding  to  each  level  of  the  discrete  spectrum. 

Chapter  14 

14.1  Evaluate  the  shift  in  energy  levels  of  a  harmonic  oscillator 
produced  by  a  perturbing  5-function  potential  that  is  centrally 
located.  State  the  limit  of  validity  of  the  approximation. 

14.2  Calculate  the  lowest  order  effect  on  the  spectrum  of  a  linear 
harmonic  oscillator  due  to  the  relativistic  increase  in  mass 
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of  the  particle.  In  what  circumstance  is  the  relativistic  effect 
pronounced?  When  is  the  perturbation  approximation  valid? 
Cf.  problem  2.10. 

14.3  Treat  the  Stark  effect  of  the  n  -=  3  level  of  hydrogen.  Deter- 
mine   the   pattern   of  splitting   and   residual  multiplicity  by 
symmetry  arguments.    How  does  the  secular  equation  factor 
in   the  representation  in  which  Lz  is  diagonal?   What  linear 
combinations  of  unperturbed  eigenfunctions  are  energy  eigen- 
f  unctions  in  the  presence  of  the  field? 

14.4  Evaluate  the  static  (o>      0)  electronic  polar  izability  of  a  hy- 
drogen atom  in  its  ground  state. 

14.5  For    the    continuous    part   of  a  spectrum,  the  states  may  be 
labeled  by  the  eigenfunctions  of  the  free  particle  Hamiltonian 
H0   and   an   (exact)    integral   equation  for  the  eigenfunctions 
Wk(r)    of   the    Hamiltonian  f/o  f  V  set  up  along  the  pattern  of 
perturbation   theory.    The  integral  form  of  the  Schrodinger 
equation  is 


Wk(r)    -     exp/k-r 

where  GR  is  the  energy  Green's  function 


, 
4n\r  -  r  | 

characteristic  of  the  free  particle  Hamiltonian.  The  Green's 
function  can  be  defined  in  terms  of  a  complete  set  of  energy 
eigenfunctions  0n(r)  of  the  (any)  Hamiltonian  HO  as  follows: 


(a)  Formulate  a  differential  equation  and  boundary  conditions 
for  the  Green's  function. 

(b)  Evaluate    the   explicit  form  of  the  free  particle  Green's 
function  from  its  definition  in  terms  of  a  complete  set  of 
energy  eigenstates. 

(c)  Find  the  relation  between  the  energy  Green's  function  and 
the  time  Green's  function  Gt.  (See  problem  5.3.) 

(d)  The  effect  of  a  "short  range"  potential  on  the  wave  func- 
tion at  large  distances  from  the  scattering  center  is  de- 
scribed by  the  "scattering  amplitude  f(0)9"  defined  by  the 
asymptotic  form  (for  r  *  °°) 


. 
-   elk' 


(See  Chapter  28,  Section  B)  Derive  an  exact  expression  for 
f(0)  in  terms  of  the  exact  solution  ¥k  of  the  Schrodinger 
equation. 
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(e)  Formulate  a  perturbation  approximation  for  *k  and  f(0)on 

the  basis  of  the  above  results. 

14.6  For  a  definite  value  of  the  angular  momentum  the  integral 
from  of  the  Schrodinger  equation  for  a  spherically  symmetric 
potential  takes  the  form 

uki(r)  =   sinkr  +    —  —    /      gEj(iy')  V(r')uki(r')dr'  , 
n*    JQ 


where 


SEQ(r 


s'}  =  —  -  \sink  \  r  -  r'  |  -  sink  \  r  +  r'  \  L 

K    L  J 


for  if  =  0. 

(a)  Evaluate  gEi(rs')  for  /  >  0. 

(b)  What  is  the  relation  between  the  wave  functions  and  be- 
tween   the    Green's    functions   of  the    three-dimensional 
(problem   14.5)  and  the  partial  wave  formulations.   It  is 
necessary  to  distinguish  ingoing  and  outgoing  wave  bound- 
ary conditions  e±lkr/r.  The  addition  formula  for  spherical 
harmonics  is  helpful, 


21    f    1      m=-/ 

where  cos©  =  cos0  cos0'  +  sintf 
(c)  An  exact  expression  for  the  s-wave  phase  shift  is 


/I 


T00 

/      si 

«/Q 


sinfer 


Verify  this  relation  and  generalize  it  for  I  >  0.  What  nor- 
malization condition  is  implied  for  uk?  See  problem  12.4. 


Chapter  15 

15.1  Identify  the  conditions  under  which  the  Klein-Gordan  equation 
reduces  to  the  Schrodinger.   How  does  the  initial  data  prob- 
lem for  the  second  order  differential  equation  reduce  to  that 
of  the  first  order  equation? 

15.2  Show  that  the  function- space  inner  product 


'/[* 


**•"  I**  f -IT 

is  independent  of  time  if  0  and  0  are  solutions  of  the  Klein- 
Gordan  equation.  Find  the  form  of  the  inner  product  in  mo- 
mentum space.  Is  the  inner  product  positive  definite? 
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15.3  A  general  solution  of  the  Klein-Gordan  equation  is  a  super- 
position  of  a   positive-frequency   and   a  negative-frequency 
part,   with   time   dependence    e~ta)t    and    e*1"',  respectively. 
Show    that  under  a   Lorentz   transformation  that  does  not 
reverse  the  sense  of  time,  a  positive-frequency  solution  is 
transformed  into  a  positive- frequency  solution,  i.e.,  that  the 
decomposition  is  Lorentz  invariant. 

15.4  Find   a   complete  set  of  positive- frequency  solutions  of  the 
Klein-Gordan  equation  that  are  orthonormal  with  respect  to 
the  inner  product  in  problem  15.2.   Show  that  the  complete- 
ness relation  (for  unequal  times)  is 


where    <y  -  <u(k)  -  cVfc2  +  m2c2/^2    and  where  the  Minkowski 
space  inner  product  is  k(x  -x')  =  <^(t  -  t'}  -  k  •  (\  -  \). 


Chapter  16 

16.1  What  is  the  value  of  the  g  factor  of  a  particle  described  by 
the  Hamiltonian  (l/2w0)  [V  -  (p  -  (e/c)  A)]2? 

16.2  A  particle   of  spin  1/2  and  magnetic  moment  p,  moves  in  a 
precessing  magnetic  field 


Hx  =  H  smO  cosa)t,  Hy  =  H  sintfsinojf,  Hz  =  H  cos0. 

At  time  t  -  0,  the  spin  is  parallel  to  the  z  axis.  What  is  the 
probability  of  the  spin  being  antiparallel  to  the  z  axis  at  some 
later  time? 

16.3  Find  the  (time-  dependent)  position  operators  in  the  Heisen- 
berg   representation  of  a  particle  of  spin  1/2  and  magnetic 
moment  u  moving  in  a  nonhomogeneous  magnetic  field 

Hx  =  0,  Hy  =  -fey,   H2  =  HO  +  kz. 

16.4  Determine  the  energy  spectrum  and  wave  functions  of  a  charged 
particle  moving  in  uniform  electric  and  magnetic  fields  that 
are  perpendicular  to  one  another. 


Chapter  17 

17.1  Show   that   the   four  matrices  (/',o')  are  Hermitian,  linearly 
independent  and  form  a  complete  basis  for  2x2  matrices. 

17.2  How  many  matrices  are  required  to  form  a  complete  set  of 
Hermitian,    linearly  independent  NxN  matrices?   How  many 
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mutually   commuting,    Hermitian,    linearly  independent 
matrices  are  there? 

17.3  In  the  Dirac  theory,  a  and  p  are  mutually  commuting  linear 
operators.    Develop  a  notation  for  the  linear  vector  space  on 
which    the    operators  act.    Find  an  expression  for  the  inner 
product  in  the  composite  (direct  product)  space.    Show  that 
the  Dirac  Hamiltonian  is  Hermitian. 

17.4  Use  the  Heisenberg  equation  of  motion  to  show 

(a)  --(x   f  iKp3a/2mc)    -=    p3(p  -  (e/c)\)/m  , 

(b)  ---(/  i   zfipg/2/nc-2)   -   p3(U  -  e<D)/mc2. 

17.5  What  transformation  of  the  spinor  wave  function  must  be  made 
to  restore  the  form  of  the  Dirac  equation  after  a  change  of 
gauge  in  the  electromagnetic  potentials? 

17.6  Show  that  the  form  of  the  Dirac  equation  remains  unchanged 
by  Lorentz  transformation  or  spatial  rotation. 

Chapter  18 

18.1  Suppose  a   system  consists  of   two   weakly  interacting  sub- 
systems each  of  spin  1/2.  The  total  spin  is  S  =  si  +  82.  What 
are  the  possible  eigenvalues  of  S2  and  S2?  Compute  the  value 
of    si  •  s2    in   the   triplet   (spins   parallel)  and  singlet  (spins 
antiparallel)  states  of  the  composite  system.  Find  eigenfunc- 
tions  of  S2  and  S^  as  linear  combinations  of  products  of  eigen- 
functions  of  the  subsystems, 

18.2  An   electron  moving  in  a  central  field  of  force  is  in  a  state 
specified  by  the  quantum  numbers  ljmjm  What  are  the  possible 
values  of  the  z  components  of  orbital  and  spin  angular  mo- 
mentum and  what  is  the  average  value  of  each? 

18.3  Is  the  parity  operator  /  a  linear  operator?  Is  it  Hermitian? 
What  are  the  commutation  properties  of  /  with  the  operators 
r,  P,  L,  S  and  ,1? 

18.4  An  axial  vector  transforms  as  a  vector  under  proper  rota- 
tions   (i.e.,    rotations  without  space  inversion)  but  does  not 
change  sign  under  inversion.  Classify  the  following  as  being 
either   vector   or   axial   vector:     1%    H,    A,   S,  L,  p  and  u  x  v 
where  u  and  v  are  vectors. 

18.5  Show  that  if  a  system  is  in  a  state  characterized  by  a  sharp 
value   mj  for  the  2  component  of  the  total  angular  momentum, 
the  mean  value  of  the  total  angular  momentum  about  an  axis 
z'  making  an  angle  0  with  the  ^  axis  is  m}  cos  0. 

18.6  Evaluate    the    particle   flux   of  positive  energy  and  negative 
energy  plane  wave  solutions  of  the  Dirac  equation.  Also  cal- 
culate the  flux  of  the  corresponding  charge  conjugate  solutions. 

18.7  Can  the  charge  conjugation  transformation  be  represented  by 
a  linear  operator? 
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Chapter  19 

19.1  Evaluate  p  and  j  for  a  Dirac  particle  in  an  electromagnetic 
field.  Verify  that  the  Lorentz  force  is  given  correctly. 

19.2  Calculate  the  spin-orbit  and  contact  potentials  for  an  elec- 
tron outside  a  closed  atomic  core  consisting  of  a  hydrogen- 
like  atom. 

Chapter  20 

20.1  Calculate  the  energy  level  splitting  of  a  hydrogen  atom  in  a 
weak  electric  field.    Neglect  the  Lamb  shift  and  assume  the 
Stark  effect  is  small  in  comparison  with  the  fine  structure. 
Account  for  the  latter  by  using  eigenfunctions  of  J2,  Jz  andL2 
for  the  unperturbed  states. 

20.2  Evaluate  the  Stark  effect  for  the  n  -  2,  j  =  1  level  of  hydrogen 

for  the  case  where  the  Stark  effect  is  comparable  with  the 
Lamb  shift.  Represent  the  Lamb  shift  by  a  phenomenological 
perturbation  matrix  element  that  touches  only  the  s-  state. 

20.3  Study   the   hydrogen   Stark  effect   for   the    level  n  --  2  in  the 
transition  region  where  the  Stark  effect  and  the  fine  structure 
are  of  the  same  order  of  magnitude.  Plot  the  energy  levels 
as  a  function  of  P. 

20.4  Compute  the  lifetimes  of  the  2py  and  2p^  states  of  hydrogen. 

20.5  Find   an  expression  for   the  lifetime  of  the  metastable  2s^ 

state  of  hydrogen  in  a  very  weak  electric  field.  At  what  field 
strength  will  the  lifetime  by  10"3sec?  Is  the  Stark  level  shift 
appreciable  at  this  field  strength?  Hint:  Evaluate  the  matrix 

element  of  r  using  a  perturbed  2s^  wave  function  that  con- 


tains an  admixture  of  the  2p    state  and  take  advantage  of  the 

smallness  of  the  Lamb  shift. 

20.6  Find  the  mean  value  of  the  operator  u  -  L  +  gsS  in  a  state 
characterized  by  the  quantum  numbers  J,  J2,  L,  S.  (This 
gives  a  generalization  of  the  Lande  formula  in  the  case  where 
the  g  factor  of  the  electron  is  not  exactly  equal  to  two.  See 
page  346.)  Hint:  Use  problem  11.3(e). 

Chapter  21 

21.1  Determine  the  form  of  the  contact  interaction  when  the  finite 
nuclear  size  is  taken  into  account.  Calculate  the  splitting  of 

the   2s-*--2pl   levels  of  hydrogen  stemming  from  this  effect. 
It  is  about  -  0.1  Me. 


530  PROBLEMS 

21.2  Calculate  the  hyperfine  splitting  of  the  ls^  state  of  hydrogen 

£i 

using  a  classical  model  in  which  the  nucleus  is  represented 
by  a  uniformly  magnetized  sphere  of  radius  %.  Give  a 
classical  explanation  of  the  sign  of  the  splitting. 

21.3  Find  the  magnetic  field  at  the  nucleus  of  an  s  state  electron. 

21.4  Consider  a  hydrogen  atom  in  the  ground  state  in  a  uniform 
magnetic  field.   Find  the  appropriate  linear  combinations  of 
electron-  proton  spin  wave  functions  that  are  energy  eigen- 
functions  in  the  case  where  the  interaction  with  the  external 
field  is  the  same  order  of  magnitude  as  the  electron-  proton 
dipole-dipole  interaction. 

21.5  Calculate  the  hyperfine  splitting  of  a  hydrogen-like  atom  in  a 
state    of   non-zero   orbital   angular   momentum.    Hint:    Use 
problem  11.3(e). 

Chapter  22 

22.1  Specify  a  complete  set  of  commuting  constants  of  the  motion 
for  positronium. 

22.2  Calculate    the    fine    structure    of  positronium.    Obtain  the 
Hamiltonian  by  semiclassical  arguments.  Account  especially 
for  "hyperfine'  '  splitting,  knowing  that  the  magnetic  moment 
of  the  positron  is  equal  in  magnitude  and  opposite  in  sign  to 
that  of  the  electron.    Present  the  results  in  an  energy  level 
diagram  for  n  =  1,2. 

22.3  Evaluate  the  Lamb  shift  for  positronium. 

22.4  Construct  a  theory  for  positronium  using  a  Dirac  Hamiltonian 
for  each  particle  and  the  Coulomb  interaction  between  them. 
Separate  the  motion  of  the  center  of  mass  from  the  relative 
motion  in  the  approximation  of  retaining  only  the  lowest  order 
relativistic   corrections   for  the   relative  motion.    Does  the 
"hyperfine"    interaction   emerge   automatically?   Hint:    The 
action   of  spin  operators  on  a  product  wave  function  can  be 
expressed  as  follows: 


Chapter  23 

23.1  Construct  a  complete  set  of  orthonormal  two-particle  eigen- 
functions  from  a  complete  set  of  one-particle  eigenf  unctions. 

23.2  Derive  the  Hartree-Fock  equations  for  determining  the  best 
single-particle  functions  to  give  an  antisymmetric  (or  sym- 
metric) two-particle  wave  function  of  lowest  energy. 

23.3  Calculate  the  n  -  1  and  2  levels  of  a  hydrogen-like  atom,  in- 
cluding fine  structure  and  the  Lamb  shift,  using  a  variational 
method. 
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23.4  Show  how   the   variational   method  with  trial  functions  i//  = 
2Ate~a|'r  can  be  reduced  to  a  matrix  procedure  if  the  A's  are 
regarded   as   the   variational   parameters.    The  a's  may  be 
selected  by  intuition  or  by  an  itteration  scheme.   Note  how 
this  gives  a  finite  set  of  orthonormal  vectors. 

23.5  Evaluate  the  ground  state  energy  of  helium  using  the  follow- 
ing "self-consistent"  variational  method:    assume  the  first 
electron  is  in  a  hydrogenic  Is  state  with  Z  =  Z'  and  calculate 
the  screened  field  seen  by  the  second  electron.  Describe  the 
second   electron  by   a   hydrogenic    Is   function  with  Z  =  Z". 
Vary  Z"  to  find  the  lowest  energy  for  a  given  Z'.  Then  impose 
the  symmetry  by  setting  Z'  -  Z". 

Chapter  24 

24.1  Give  the  possible  values  of  the  total  angular  momentum  for 
the  following  states  (terms):   1S,  2S,  3S,  2P.  3P,  2D  and4D. 

24.2  Which  terms  are  possible  for  the  following  two-electron  con- 
figurations:    (a)     nsn's,    (b)     nsn'p,    (c)   nsn'd,  and  (d)  np,  n'p? 
Which   terms  are  consistent  with  the  exclusion  principle  if 
»  =  »'? 

24.3  Couple  three  unit  angular  momenta  /i  =  /2  =  /a  =  1  to  yield  a 
resultant  eigenstate  L  =  1,  L*  =  0  of  L2,  where  L  =  li  +  \2  +  la. 
How  many  independent  states  of  this  sort  are  there? 

24.4  Estimate   the  low-lying  excited  state  energies  of  helium  in 
the   approximation   of  neglecting   exchange  effects.    Do  this 
assuming   that  one  electron  is  described  by  the   Is  function 
found   in   the  ground   state  calculation  and  carrying  out  the 
variational  procedure  for  a  hydrogenic  2s  (and  independently, 
2p)  wave  function  in  the  screened  Coulomb  field.  The  2s  state 
must  be  taken  orthogonal  to  some  appropriate  Is  state.  The 
root   of   the   variational  equation  may  be  found  by  a  rapidly 
converging  iteration  method. 

24.5  Calculate  the  low-  lying  excited  state  energies  of  helium  taking 
into   account  exchange   and  spin  effects.   Use  appropriately 
symmetrized   product   eigenfunctions   for  the  configurations 
(Is,    2s)   and  (Is,   2p),  where  the   Is  function  is  that  found  in 
the  ground  state  calculation.    The  required  matrix  elements 
are  given  in  problem  24.6  for  reference. 

24.6  Consider  the  hydrogenic  wave  functions 


.   6(1  - 


where  a  -  Zis/ao,  /3  -  Z2S/2ao,  and  y  -  Z2p/2ao  and  where  6,  B 
and  c  are  determined  by  the  orthogonality  and  normalization 
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conditions.  Verify  the  following  one-  and  two-particle  matrix 
elements: 

T2p  *  <<P2p.  T<P2p>   -    ?L£ 


I/  A     „  2  >7 

Vgp    -    -  2  eOZY 

f'2a2y2<a2  +  5ay  +  lOy2) 
6 (a  +  y)5 
28e2a2y2 
3  (a   I-  y)5 

8_2    h   _  £ 

6      h    a2  -aft  +  ^2 


a2  -   a^  ^    /32, 
°  "  \          <u  +  ^)2         2(a   h  /3)3(a2  -  a/6  +  j32)  / 


(a  +  /S)5(a     -  aft  +  ^8) 

24.7  Calculate  the  ground  state  of  lithium  by  means  of  a  variational 
method  based  on  an  an ti symmetrized  product  wave  function 
for  the  configuration  (Ls22s).  The  appropriate  matrix  elements 
can  be  reduced  to  those  given  in  problem  24.6. 

Answer:  The  best  variational  wave  function  is  given  by 
Zis  =  2.694  and  Z2s  =  1.534.  The  energy  is 
-  200.8  eV  compared  with  the  perturbation  theory 
value  —  190.8  eV  and  the  observed  value-  202.54 
eV. 

24.8  Calculate  the  (Is22p)  excited  state  of  lithium  by  the  variational 
method,    (a)    neglecting    exchange   effects   and   assuming   a 
helium   core,    (b)  neglecting  exchange  effects  and  using  the 
Ls2  core  found  in  problem  24.7  and  (c)  taking  exchange  effects 
into  account  using  the  antisymmetrized  wave  function  for  the 
configuration  (Ls22p). 

Chapter  25 

25.1  Show  that  the  virial  theorem  2T  \  Ve-e  +  Vn-c  -  0  is  fulfilled 
in  the  Thomas- Fermi  model.  Does  it  also  hold  in  the  varia- 
tional solution? 
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25.2  Using   the   virial  theorem,  show  that  in  the  Thomas- Fermi 
model   of  a  neutral  atom  the  energy  of  electrostatic  repul- 
sion between  the  electrons  is  1/7  the  magnitude  of  the  elec- 
trostatic attraction  between  the  electrons  and  the  nucleons. 

25.3  Estimate  the  order  of  magnitude  of  the  following  quantities 
in  a  neutral  atom  according  to  the  Thomas- Fermi  model: 

(a)  the  size  of  the  atom, 

(b)  the  average  electrostatic  repulsion  between  two  electrons, 

(c)  the  average  kinetic  energy  of  one  electron, 

(d)  the  average  speed  of  an  electron, 

(e)  the  average  angular  momentum  of  an  electron, 

(f)  the  mean  radial  quantum  number. 

25.4  Show  that  the  mean  perturbation  of  all  states  of  a  given  term 
is  zero  for  the  spin- orbit  interaction. 

25.5  Calculate  the  L  •  S  splitting  of  the  3P  term  of  helium. 

25.6  Evaluate  the  L  •  S  splitting  of  a  s ingle  np  electron  in  a  spheri- 
cally symmetric  potential.  Express  the  result  in  terms  of  an 
arbitrary  radial  matrix  element.  Estimate  the  radial  matrix 
element  for  the  doublet  splitting  of  sodium. 

25.7  Two  electrons  move  in  an  (np)2  configuration  in  a  spherically 
symmetrical  potential.  Regarding  the  electrostatic  repulsion 
between  the  electrons  as  a  perturbation,  evaluate  the  splitting 
of  terms    in   first   order   approximation.    Neglect  the  spin- 
orbit    interaction.    Use   qualitation   considerations   to   infer 
ordering  of  the  terms.    Hint:  To  facilitate  diagonalization  of 
the  secular  equation,  use  a  representation  in  which  M^  and  MS 
are  diagonal  and  note  that  the  sum  of  the  roots  of  a  secular 
equation  is  equal  to  the  sum  of  the  diagonal  matrix  elements. 


Chapter  26 


26.1  Find  the  possible  atomic  terms  (a  term  is  characterized  by 
L   and   S)    in   a   configuration  of  two  equivalent  d- electrons. 
Give  the  total  number  of  states  and  the  number  of  states  in 
each  term.  What  values  of  J  are  possible  to  each  term? 

26.2  In    an    atomic    configuration   the    term   that   has    the  lowest 
energy  can  be  determined  by  Hund's  semi-empirical  rules: 

(1)  The   ground   state   will   have  the  largest  value  of  S  con- 
sistent with  the  Paul!  principle, 

(2)  L  will  have  the  largest  value  consistent  with  the  value  of 
S  determined  in  rule  (1), 

(3)  The   total   angular  momentum  of  the  ground  state  is  J  = 
|L  -  S|    if   the   unfilled   subshell  is  half  full  or  less  than 
half  full;  it  is  J  -  L  4  S  if  the  subshell  is  more  than  half 
full. 

Give  a  qualitative  physical  justification  for  each  of  these  rules. 
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26.3  Using  Hund's  rules  (see  problem  26.2),  find  the  ground  state 
of  the  configuration  np*  for  x  =  1,  2,  . . . ,  6,  For  each  value 
of  x,  state  an  element  for  which  this  case  is  realized  physi- 
cally. 

26.4  Do  problem  26.3  for  the  configuration  ndx  for  x  =  1,  2,  . . . ,  10. 


Chapter  27 

27.1  Estimate  the  relative  frequencies  and  separation  of  energy 
levels  for  the  electronic,  vibrational  and  rotational  motions 
of  a  diatomic  molecule. 

27.2  Derive  the  Schrodinger  equation  describing  the  motion  of  the 
nuclei  of  a  diatomic  molecule  in  the  approximation  that  the 
nuclei  move  much  more  slowly  than  the  electrons  and  thus 
experience   only   an  interaction  with   the   electrons   that  is 
averaged   over   many   electron  revolutions.    This  procedure 
provides  a  separation  of  variables  between  the  electronic  and 
nuclear  motions.    The  approximation  is  called  the  adiabatic 
or  Born-Oppenheimer  approximation. 

27.3  What  are  the  possible  symmetry  states  of  the  diatomic  mole- 
cules D2,  N2,  LiH  formed  from  the  bonding  of  the  two  atoms 
in  their  ground  states? 

27.4  What  spin  symmetries  are  possible  for  the  rotational  states 
of  the  deuterium  molecule  D2  in  the  electronic  ground  state? 
The  deuterium  nucleus  has  spin  1. 

27.5  Calculate  the  energy^of  a  rigid  electric  dipole  in  a  uniform 
electric  field.  Use  second  order  perturbation  theory. 

27.6  Show   that   the   force  between   two   hydrogen  atoms  in  their 
ground  states  varies  as  l/R7  if  the  atoms  are  separated  by  a 
large  distance  R. 


Chapter  28 

28.1   The  rate  for  making  transitions  from  an  initial  state  i  to  the 
final  state  f  is 

2/7  lir 

Wlf=  T|W' 

where  pf  is  the  number  of  final  states  per  unit  energy  interval 
("Golden  Rule  No.  2"). 

(a)  Find   the   expression  for  U'lf  in  first  order  perturbation 
theory.   What  normalization  convention  is  implied  for  the 
wave  function? 

(b)  Derive  an  expression  for  M^  in  second  order  perturbation 
theory. 
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(c)  Evaluate  the  density  of  states  pf  for  a  final  state  consisting 
of  two  free  particles  of  definite  total  energy  and  momentum. 

(d)  Evaluate  pf  for  a  final  state  consisting  of  three  equal  mass 
(free)  particles.  Express  the  results  in  the  center  of  mass 
system.    How  is  the  partition  of  energy  among  the  three 
particles  accounted  for  by  a  probability  distribution? 

(e)  Calculate  the  dependence  of  pf  on  energy  near  the  energy 
threshold  for  an  N-particle  final  state. 

28.2  Calculate   the   rate  for  induced  transitions  from  state   m  to 
state  n  of  an  atomic  system  in  an  electric  field  with  spectral 

density  8  (o>)  at  transition  frequency  <»  =  1  (Em  -  En).  Identify 

n 

the  Einstein  coefficient  B  and  thereby  infer  the  rate  for 
spontaneous  emission. 

28.3  Consider  a  particle  of  mass  m  bound  in  a  three-dimensional 

harmonic  oscillator  potential  ~kr29  The  particle  is  irradiated 

z 

with  a  mild  pulse  &'  -  ex2e~(t/r^  for  the  time  interval  -«  < 
t  <  oo.  Determine  which  transitions  are  possible  and  calculate 
their  probabilities.  Identify  the  limiting  cases  of  sudden  and 
adiabatic  perturbations.  State  the  limits  of  validity  of  the 
perturbation  calculation. 

28.4  At    t   =  Of  a  hydrogen  atom  in  its  ground  state  is  irradiated 
with  a  uniform  periodic  electric  field.  Determine  the  mini- 
mum frequency  of  the  field  necessary  to  ionize  the  atom  and 
compute  the  ionization  probability  per  unit  time.  As  an  ap- 
proximation, the  electron  in  the  final  state  may  be  regarded 
as  free.  State  the  limits  of  validity  of  the  approximation. 

28.5  Show   that  the  scattering  of  slow  particles  in  a  short  range 
potential  is  characterized  by  dt(k)  ~  k2l+l.  Find  the  proportion- 
ality constant  in  Born  approximation  and  state  the  conditions 
under  which  the  approximation  is  valid. 

28.7  Taking  into  account  the  symmetry  of  the  wave  function,  give 
the  differential  cross  section  for  elastic  Coulomb  scattering 
of  an  electron  on  an  electron  and  of  an  alpha  particle  on  an 
alpha  particle.    Distinguish   spin   states   and   also  give  the 
formula  for  the  scattering  of  unpolarized  electrons.  Identify 
quantum  effects  and  show  how  they  disappear  in  the  classical 
limit.    For  reference,  the  exact  scattering  amplitude  for  a 
fixed  Coulomb  potential  is 

fc(0)    =    _         X2         .    Fd  +  iy)    ^  e-2i-ytn  sm  0/2 
2sm2|    "    HI-  iy)   " 

£t 

where  y  =  e$/Hv.  Cf.  Chapter  13,  Section  D. 

28.8  Compute   in   Born   approximation  the  differential  scattering 
cross  section  for  the  scattering  of  fast  neutrons  by  a  Coulomb 
field. 
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28.9   Set   up  coupled  Schrodinger  equations  to  describe  the  "two 
channel"  reaction  and  scattering  processes 

(11)  a  +  b    >  a  +  b  , 

(12)  a  +  b    >  c  +  d, 
(22)  c  }  d    *  c  +  rf, 

(21)  c  +  d  -»  a  +  6. 

Assume  the  particles  are  spinless  and  of  unequal  mass. 
Separate  out  the  motion  of  the  center  of  mass. 

(a)  How   must  the   potentials    Vi  1,^12,  ^21,^22  be  related  in 
order  to  describe  a  system  that  is  invarient  with  respect 
to  reversal  of  the  sense  of  time?  How  are  the  cross  sec- 
tions for  reactions  (12)  and  (21)  related?  (detailed  balance) 

(b)  Give  expressions  for  the  effective  cross  sections  for  these 
reactions  in  Born  approximation. 

(c)  Formulate  an  expression  for  the  reaction  and  scattering 
amplitudes  in  terms  of  partial  wave  amplitudes  of  definite 
angular  momentum.  What  relation  among  fi  i  J\  2,  (  21^22  (f°r 
a  definite  value  of  /)  is  implied  by  conservation  of  prob- 
ability? 

Note:  The  results  are  presented  most  simply  in  terms  of  a 
matrix  notation.  The  many-channel  generalization  of  the 
quantity  e>2'5/  is  called  the  S-  matrix. 

Chapter  29 

29.1    Verify  the  antic  ommutation  relations 

0a<r,f)0£<P;«)   +    ^(K/)<k<r,f)    -    /tt/,S<r-f) 

y^(r,0i/^(r,'*)    i-    ^(^t)^^)   --    0 

satisfied  by  the  second  quantized  Dirac  wave  function.  The  a 
and  ft  indicate  components  of  the  Dirac  spinor  and  /a/?  is  the 
unit  4x4  matrix.  The  positive-frequency  parts  of  <//  and  i//+ 
describe  the  creation  of  positrons  and  electrons  .respectively, 
while  the  negative-  frequency  parts  describe  the  annihilation  of 
electrons  and  positrons,  respectively.  Electrons  and  positrons 
are  described  by  independent  sets  of  mutually  anticommuting 
creation  and  annihilation  operators 

-   0  , 


0  . 

29.2  Evaluate  the  commutation  relations  between  components  of 
the  current  and  charge  density  of  a  quantized  Dirac  wave 
field,  i.e.,  evaluate  j^{r,0jv(i#)  -  7v(r;0;M(r,0  at  equal  times. 
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29.3  Calculate    the    Born   approximation   matrix  element  for  the 
production  of  an  electron- positron  pair  in  an  external  electric 
field 

0(r,£)    =    <I>0  COS  kt  COS  a)t . 

29.4  Express  the  energy  and  momentum  operators  of  a  quantized 
Dirac  wave  field  in  terms  creation  and  distruction  operators. 

29.5  Let  M(x)  be  a  massive  pseudoscalar  meson  field  satisfying 
the  Klein-Gordan  equation.    Calculate  the  force  between  two 
stationary   Dirac    "nucleons"    in  second  order  perturbation 
theory,    assuming   the    interaction  energy  between  the  "nu- 
cleon"  and  meson  fields  is 


H'  -  g  /  i/ 


